https://github.com/brianprost/markitdown-lambda
https://github.com/brianprost/markitdown-lambda
Last synced: about 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/brianprost/markitdown-lambda
- Owner: brianprost
- Created: 2025-04-06T16:21:03.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-04-06T18:01:19.000Z (about 1 year ago)
- Last Synced: 2025-04-06T19:19:52.823Z (about 1 year ago)
- Language: Python
- Size: 46.9 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# markitdown-aws-lambda
A Lambda function wrapper for Microsoft's [markitdown library](https://github.com/microsoft/markitdown) to convert various file formats to Markdown Text.
## Supported File Types
- Documents: PDF, DOC, DOCX, PPT, PPTX
- Spreadsheets: XLS, XLSX, CSV
- Images: JPG, JPEG, PNG, TIFF
- Audio: MP3, WAV, OGG
- Web: HTML, HTM
- Data: JSON, XML
- Other: ZIP, EPUB, YouTube links
## Usage
```bash
uv run main.py
```
The converted Markdown content will be saved to `output.md`.
## Requirements
- Python 3.13+
- I'm using `uv` for managing the package, but you can also use `pip` to install dependencies if you prefer.
- Dependencies listed in pyproject.toml