#1126837 ITP: markitdown -- A CLI tool for converting files and office documents to Markdown

#1126837#5
Date:
2026-02-02 08:21:35 UTC
From:
To:

* Package name    : markitdown
  Version         : 0.1.4
  Upstream Author: Microsoft
* URL             : https://github.com/microsoft/markitdown
* License         : MIT
  Programming Lang: Python
  Description     : Python tool for converting files and office documents to Markdown.


MarkItDown is a lightweight Python utility for converting
various files to Markdown for use with LLMs and related text analysis
pipelines. To this end, it is most comparable to textract<https://github.com/deanmalmgren/textract>,
 but with a focus on preserving important document structure and content
 as Markdown (including: headings, lists, tables, links, etc.) While the
 output is often reasonably presentable and human-friendly, it is meant
to be consumed by text analysis tools -- and may not be the best option
for high-fidelity document conversions for human consumption.

This is a dependency for Alpaca. Me and Arian Ott will be maintaining this in the Python team