{"id":25686130,"url":"https://github.com/felipefontoura/doc2md","last_synced_at":"2025-06-21T08:34:08.219Z","repository":{"id":272681997,"uuid":"917418515","full_name":"felipefontoura/doc2md","owner":"felipefontoura","description":"Convert documents to Markdown format through a simple API service.","archived":false,"fork":false,"pushed_at":"2025-02-11T12:27:32.000Z","size":18,"stargazers_count":27,"open_issues_count":2,"forks_count":16,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-19T05:13:37.709Z","etag":null,"topics":["audio","csv","docx","html","image","json","markdown","ocr","pdf","xlxs","xml"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/felipefontoura.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-01-16T00:11:52.000Z","updated_at":"2025-06-12T19:52:07.000Z","dependencies_parsed_at":"2025-01-16T01:28:01.127Z","dependency_job_id":"aa5cb453-3d6f-4c74-87fa-556d554e1c8e","html_url":"https://github.com/felipefontoura/doc2md","commit_stats":null,"previous_names":["felipefontoura/doc2md"],"tags_count":3,"template":false,"template_full_name":null,"purl":"pkg:github/felipefontoura/doc2md","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/felipefontoura%2Fdoc2md","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/felipefontoura%2Fdoc2md/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/felipefontoura%2Fdoc2md/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/felipefontoura%2Fdoc2md/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/felipefontoura","download_url":"https://codeload.github.com/felipefontoura/doc2md/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/felipefontoura%2Fdoc2md/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":261093067,"owners_count":23108581,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio","csv","docx","html","image","json","markdown","ocr","pdf","xlxs","xml"],"created_at":"2025-02-24T19:07:26.005Z","updated_at":"2025-06-21T08:34:03.144Z","avatar_url":"https://github.com/felipefontoura.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Doc2MD\n\n[![Docker Pulls](https://img.shields.io/docker/pulls/felipefontoura/doc2md)](https://hub.docker.com/r/felipefontoura/doc2md)\n[![Docker Image Size](https://img.shields.io/docker/image-size/felipefontoura/doc2md)](https://hub.docker.com/r/felipefontoura/doc2md)\n\nConvert documents to Markdown format through a simple API service.\n\n## 🚀 Quick Start\n\n### Using Docker (Recommended)\n\n```bash\ndocker run -d -p 5000:5000 felipefontoura/doc2md\n```\n\n### API Usage\n\nConvert a document to Markdown:\n\n```bash\ncurl -X POST \\\n  -H \"Content-Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet\" \\\n  --data-binary \"@your_document.xlsx\" \\\n  http://localhost:5000/convert\n```\n\n## ✨ Features\n\n- Convert multiples files to Markdown (PDF, PowerPoint, Word, Excel, Images, Audio, HTML, CSV, JSON, XML and ZIP).\n- OCR for PDF files.\n- Simple REST API interface\n- Docker support\n- Easy deployment with Docker Stack\n\n## 🛠️ Installation\n\n### Using Docker Hub\n\n1. Pull the image:\n\n```bash\ndocker pull felipefontoura/doc2md\n```\n\n2. Run the container:\n\n```bash\ndocker run -d -p 5000:5000 felipefontoura/doc2md\n```\n\n## 💻 Usage\n\n### API Endpoints\n\n#### Convert Document\n\n```bash\ncurl -X POST \\\n -H \"Content-Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet\" \\\n --data-binary \"@your_document.xlsx\" \\\n http://localhost:5000/convert?ocr=true/false\n```\n\n## 📦 Deployment\n\n### Docker Stack Deployment\n\nDeploy using [Docker Stack](stack.yml):\n\n```bash\ndocker stack deploy --prune --resolve-image always -c stack.yml doc2md\n```\n\nExample `doc2md.yml`:\n\n```yaml\nversion: \"3.7\"\nservices:\n  doc2md:\n    image: felipefontoura/doc2md:latest\n    environment:\n      - OPENAI_API_KEY=sk-xxx\n      - LLM_MODEL=gpt-4o-mini\n      - WORKERS=4\n      - TIMEOUT=0\n    ports:\n      - \"5000:5000\"\n    deploy:\n      replicas: 1\n      restart_policy:\n        condition: on-failure\n```\n\n## 🔧 Development\n\n1. Clone the repository\n2. Build the Docker image locally\n3. Run tests\n4. Submit pull requests\n\n## 📝 License\n\n[MIT License](https://opensource.org/licenses/MIT)\n\n---\n\nMade with ❤️ by Felipe Fontoura\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffelipefontoura%2Fdoc2md","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ffelipefontoura%2Fdoc2md","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffelipefontoura%2Fdoc2md/lists"}