{"id":44630484,"url":"https://github.com/hddevteam/vscode-md-converter","last_synced_at":"2026-02-27T10:16:26.291Z","repository":{"id":302742861,"uuid":"1013474896","full_name":"hddevteam/vscode-md-converter","owner":"hddevteam","description":"一键转换Word、Excel、PDF、PowerPoint文档为Markdown格式的VS Code扩展。支持批量处理、智能文本提取、演讲者备注提取，提供中英文双语界面。OneClick Markdown Converter VS Code Extension with PowerPoint support, batch processing, and bilingual interface.","archived":false,"fork":false,"pushed_at":"2026-01-14T03:04:50.000Z","size":6561,"stargazers_count":10,"open_issues_count":3,"forks_count":1,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-02-14T22:48:35.113Z","etag":null,"topics":["batch-processing","bilingual","chinese","document-converter","excel","excel-to-markdown","i18n","markdown","pdf","pdf-to-text","powerpoint-to-markdown","typescript","vscode-extension","word","word-to-markdown"],"latest_commit_sha":null,"homepage":"https://hddevteam.github.io/vscode-md-converter/","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/hddevteam.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-07-04T01:07:46.000Z","updated_at":"2026-01-14T03:04:41.000Z","dependencies_parsed_at":"2025-07-04T02:28:46.389Z","dependency_job_id":"06c47d72-0535-4a65-bdde-e995dbda2937","html_url":"https://github.com/hddevteam/vscode-md-converter","commit_stats":null,"previous_names":["hddevteam/vscode-md-converter"],"tags_count":14,"template":false,"template_full_name":null,"purl":"pkg:github/hddevteam/vscode-md-converter","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hddevteam%2Fvscode-md-converter","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hddevteam%2Fvscode-md-converter/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hddevteam%2Fvscode-md-converter/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hddevteam%2Fvscode-md-converter/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/hddevteam","download_url":"https://codeload.github.com/hddevteam/vscode-md-converter/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hddevteam%2Fvscode-md-converter/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29890861,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-27T09:48:51.284Z","status":"ssl_error","status_checked_at":"2026-02-27T09:48:43.992Z","response_time":57,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["batch-processing","bilingual","chinese","document-converter","excel","excel-to-markdown","i18n","markdown","pdf","pdf-to-text","powerpoint-to-markdown","typescript","vscode-extension","word","word-to-markdown"],"created_at":"2026-02-14T16:26:08.646Z","updated_at":"2026-02-27T10:16:26.268Z","avatar_url":"https://github.com/hddevteam.png","language":"TypeScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"# OneClick Markdown Converter - VS Code Extension\n\n**English** | [中文](README.zh-cn.md) | **[🌐 Website](https://hddevteam.github.io/vscode-md-converter/)**\n\nA powerful VS Code extension for converting various document formats to Markdown and text formats with just one click.\n\n## 🚀 Features\n\n### Supported Document Types\n- **Word Documents** (.docx, .doc) → Markdown\n- **Markdown Files** (.md, .markdown) → Word Documents (.docx) ⭐ **NEW in v0.4.0**\n- **Excel Spreadsheets** (.xlsx, .xls, .csv) → Markdown Tables\n- **Excel Spreadsheets** (.xlsx, .xls) → CSV Files\n- **PDF Documents** (.pdf) → Text Files\n- **PDF Documents** (.pdf) → PNG Images *(requires poppler-utils)*\n- **PowerPoint Presentations** (.pptx, .ppt) → Markdown\n\n### 🆕 v0.5.3 Highlights\n- **✅ Windows Compatibility Improvements** - Improved runtime stability and command reliability on Windows.\n- **🎯 User Value** - Fewer interruptions, smoother conversion workflows, and better cross-platform consistency.\n\n### 🆕 v0.5.1 Highlights\n- **✅ Word to Markdown** - Numbered lists keep the right text (no repeated \"$1\").\n\n### 🆕 v0.5.0 Highlights\n- **📝 Enhanced Markdown to Word** - Significant improvements to the DOCX generator:\n  - **Smart List Numbering** - Ordered lists now restart numbering for separate list blocks\n  - **Nested List Support** - Bullets under ordered items no longer break numbering\n  - **Blockquote Isolation** - Lists inside blockquotes have independent numbering\n  - **Preserve Line Breaks** - Hard line breaks in paragraphs are now correctly preserved\n  - **Improved Blockquote Rendering** - Multi-paragraph blockquotes are no longer collapsed\n- **📝 Markdown to Word Conversion** - Convert Markdown files to professional Word documents with full formatting support\n- **📋 Better Lists in Word (v0.4.1)** - Bullet lists with \"*\" now render as real symbol lists with proper indent; nested items look clean and structured\n- **📋 HTML List Support** - Complete support for HTML lists in table cells with proper Word formatting\n- **🔥 Multi-File Selection** - Select multiple files in VS Code Explorer and convert them all to Markdown at once\n- **🎛️ Configurable Info Blocks** - Choose what information to include in converted Markdown:\n  - File title headers and source notices\n  - File metadata (size, modification date, creation date)\n  - Document-specific metadata (author, sheet count, slide count)\n  - Conversion warnings and content headings\n  - Section separators and custom formatting\n- ** PDF Page Range Export** - Extract specific pages as text or images\n- **📊 Excel Worksheet Range Export** - Export selected worksheets to Markdown/CSV\n- **🎨 PowerPoint Slide Range Export** - Convert specific slides to Markdown\n- **� Mixed File Type Support** - Process Word, Excel, and PowerPoint files in a single operation\n- **💾 Preference Persistence** - Remember your info block choices for future conversions\n- **📊 Smart Progress Reporting** - Real-time conversion progress with detailed status updates\n\n### Core Features\n- ✅ **Context Menu Integration** - Convert directly from file explorer\n- ✅ **Command Palette Support** - Access via Cmd+Shift+P\n- ✅ **Multi-Select Conversion** - Convert multiple files at once with customizable output\n- ✅ **Batch Conversion** - Select folders for bulk processing\n- ✅ **Range Selection** - Choose specific pages, worksheets, or slides to convert\n- ✅ **Smart Text Processing** - Automatically optimize conversion quality\n- ✅ **Progress Indicators** - Real-time conversion progress display\n- ✅ **Error Handling** - Comprehensive error messages and handling\n- ✅ **Multi-language Support** - Automatic language switching between English and Chinese\n- ✅ **Performance Optimized** - Efficient processing of large documents\n\n## 📦 Installation\n\n### From VS Code Marketplace (Recommended)\n1. Open VS Code Extensions Marketplace (`Ctrl+Shift+X` or `Cmd+Shift+X`)\n2. Search for \"Document Converter\" or \"hddevteam\"\n3. Click **Install**\n\n### Direct Link\nVisit the [VS Code Marketplace](https://marketplace.visualstudio.com/items?itemName=luckyXmobile.document-md-converter) and click **Install**.\n\n### Manual Installation (For Development)\n```bash\n# Clone the project\ngit clone https://github.com/hddevteam/vscode-md-converter.git\ncd vscode-md-converter\n\n# Install dependencies\nnpm install\n\n# Compile extension\nnpm run compile\n\n# Press F5 in VS Code for debugging\n```\n\n## 🎯 Usage\n\n### Single File Conversion\n1. **Context Menu**: Right-click on document files in file explorer and select appropriate conversion option\n2. **Command Palette**: \n   - Press `Cmd+Shift+P` (macOS) or `Ctrl+Shift+P` (Windows/Linux)\n   - Type conversion commands (e.g., \"Convert Word to Markdown\")\n\n### Batch Conversion\n1. Right-click on a folder containing documents\n2. Select \"Batch Convert Documents\"\n3. Follow prompts to select conversion types and options\n\n### Available Commands (v0.4.0)\n- `Convert Word to Markdown` - Convert Word documents to Markdown\n- `Convert Markdown to Word` - Convert Markdown files to Word documents (.docx)\n- `Convert Excel to Markdown` - Convert Excel files to Markdown tables\n- `Convert Excel to CSV` - Convert Excel files to CSV format\n- `Convert PDF to Text` - Convert PDF to text files\n- `Convert PDF to Images` - Convert PDF pages to PNG images *(requires poppler-utils)*\n- `Convert PowerPoint to Markdown` - Convert PowerPoint presentations to Markdown\n- `Extract Word Tables to CSV` - Extract tables from Word documents to CSV format\n- `Extract PDF Tables to CSV` - Extract tables from PDF documents to CSV format\n- `Export Specified PDF Pages to Text` - Convert selected PDF pages to text\n- `Export Specified PDF Pages to Images` - Convert selected PDF pages to images\n- `Export Selected Excel Worksheets to Markdown` - Convert chosen worksheets to Markdown\n- `Export Selected Excel Worksheets to CSV` - Convert chosen worksheets to CSV\n- `Export PowerPoint Slides to Markdown` - Convert selected slides to Markdown\n- `Convert Selected to Markdown` - Convert multiple selected files to Markdown with customizable info blocks\n- `Batch Convert Documents` - Batch convert documents\n- `Open Document Converter` - Open converter interface\n\n## 🔧 Conversion Features\n\n### Markdown Document Conversion ⭐ **NEW (v0.4.0)**\n- **Comprehensive Markdown Support**:\n  - All heading levels (H1-H6)\n  - Text formatting (bold, italic, strikethrough)\n  - Inline code and code blocks with syntax highlighting\n  - Ordered and unordered lists with nesting\n  - Tables with alignment support\n  - Blockquotes and horizontal rules\n  - Links and images\n- **Professional Output**:\n  - Clean Word document (.docx) structure\n  - Proper formatting preservation\n  - Unicode and multilingual support\n  - Efficient file size optimization\n- **Performance Optimized**:\n  - Fast conversion (typically \u003c 50ms per document)\n  - Efficient handling of large documents\n  - Streaming output for memory efficiency\n- **Quality Features**:\n  - Preserves document structure and hierarchy\n  - Maintains text formatting across all elements\n  - Handles edge cases and complex nested structures\n  - Consistent output across platforms\n\n**How to use**:\n1. Right-click on a `.md` or `.markdown` file in VS Code Explorer\n2. Select \"Convert Markdown to Word\" from context menu\n3. Or use Command Palette: `Cmd+Shift+P` → \"Convert Markdown to Word\"\n4. Output Word document will be created in the same directory\n\n### Word Document Conversion\n- Preserve text formatting (bold, italic, etc.)\n- Convert heading levels\n- Handle lists and tables\n- Support .docx and .doc formats\n- Smart timeout handling (prevents .doc file freezing)\n\n### Excel Spreadsheet Conversion\n- **To Markdown**: Convert to Markdown table format\n- **To CSV**: Convert to CSV format with configurable options\n  - Choose separate files (one per worksheet) or combined file\n  - Configurable encoding (UTF-8 or GBK for Chinese Excel compatibility)\n  - Configurable delimiter (comma, semicolon, or tab)\n  - Optional metadata comments\n- Preserve cell data types\n- Handle multiple worksheets\n- Support .xlsx, .xls, .csv formats\n- Automatic data formatting\n\n### PDF Document Conversion\n\n#### Text Extraction\n- **Advanced Text Processing Algorithms**:\n  - Smart space correction\n  - Word boundary detection\n  - Punctuation formatting\n  - Hyphenated word reconstruction\n- **Text Quality Optimization**:\n  - Remove extra whitespace\n  - Fix common spacing issues\n  - Protect URL and email formats\n  - Sentence structure optimization\n- **Output Enhancement**:\n  - Add document metadata\n  - Organize content by paragraphs\n  - Markdown format output\n\n#### Image Conversion *(New Feature)*\n- **PDF to Images**: Convert PDF pages to high-quality PNG images\n- **Tool Requirement**: Requires poppler-utils installation\n- **Standard Settings**: 300 DPI resolution for optimal quality\n- **Batch Processing**: Convert multiple PDFs with progress tracking\n- **Cross-Platform**: Automatic tool detection with installation guidance\n- **Organized Output**: Creates structured folder hierarchy for images\n- **One-Click Setup**: Simple installation guidance for missing tools\n\n**Installation Guide for poppler-utils**:\n- **macOS**: `brew install poppler`\n- **Windows**: Download portable version or use package manager\n- **Linux**: `sudo apt-get install poppler-utils`\n\n### PowerPoint Presentation Conversion\n- **Slide Content Extraction**:\n  - Extract text from all slides\n  - Intelligent paragraph grouping\n  - Header detection and formatting\n  - List item identification\n- **Presentation Structure**:\n  - Maintain slide order and hierarchy\n  - Extract presentation metadata (author, title, subject)\n  - Include slide count and file information\n- **Speaker Notes Support**:\n  - Extract and organize speaker notes\n  - Link notes to corresponding slides\n  - Preserve note formatting\n- **Format Compatibility**:\n  - Support both .pptx (modern) and .ppt (legacy) formats\n  - Provide guidance for .ppt format conversion\n  - Handle complex presentation layouts\n\n## 🌐 Multi-language Support\n\nThe extension automatically switches interface language based on VS Code language settings:\n- **English** (default) - For English environments\n- **Chinese** - Automatically detects Chinese environments\n\nSupported language features:\n- Command titles and descriptions\n- User interface text\n- Error messages and prompts\n- Configuration option descriptions\n\n## 📁 Output Formats\n\nAll converted files are saved in the same directory as the original files with appropriate formats:\n- Word → `.md` files\n- Excel → `.md` files (with tables)\n- PDF → `.txt` files\n\n## 🚀 Latest Features (v0.4.1)\n\n### Smoother Lists in Word ✨\n- Bulleted lists (using \"*\") keep their dot symbols instead of turning into numbers\n- Sub-items自动缩进，层级更清晰，阅读更舒适\n- 表格和HTML中的列表也能一致呈现，复制粘贴后不乱形\n\n## 🚀 Upcoming Features (v0.5.0+)\n\nWe're actively working on expanding format support and adding new capabilities:\n\n- **🖼️ PDF to Images**: Convert PDF pages to individual image files (PNG/JPG)\n- **📤 Enhanced Export**: Export PowerPoint slides and PDF pages as high-quality images\n- **🏗️ Architecture Improvements**: Better plugin system for future format extensions\n- **🎨 Custom Templates**: User-defined output templates for different conversion scenarios\n\n[View our complete roadmap](ROADMAP.md) | [Track progress on GitHub](https://github.com/hddevteam/vscode-md-converter/issues)\n\n## ⚙️ Configuration Options\n\nThe extension supports the following configuration options (search for \"Document Converter\" in settings):\n- Output directory settings\n- Excel maximum rows limit\n- Format preservation options\n- Auto-open result files\n\n## 🛠️ Technical Implementation\n\n### Dependencies\n- **mammoth.js** - Word document processing\n- **xlsx** - Excel file processing  \n- **pdf-parse** - PDF text extraction\n- **VS Code API** - Extension integration\n\n### Architecture Design\n```\nsrc/\n├── converters/           # Core converters\n│   ├── wordToMarkdown.ts\n│   ├── excelToMarkdown.ts\n│   └── pdfToText.ts\n├── commands/            # VS Code command handlers\n├── i18n/               # Internationalization support\n│   ├── index.ts        # I18n manager\n│   ├── en.ts          # English language pack\n│   └── zh-cn.ts       # Chinese language pack\n├── utils/              # Utility functions\n├── types/              # TypeScript type definitions\n└── extension.ts        # Extension entry point\n```\n\n## 🐛 Troubleshooting\n\n### Common Issues\n\n1. **Conversion Failed**\n   - Check if file is corrupted\n   - Ensure file is not open in other programs\n   - Check VS Code Developer Console for error messages\n\n2. **.doc File Conversion Freezing**\n   - Extension has built-in timeout mechanism\n   - Recommend converting .doc files to .docx before processing\n\n3. **Poor PDF Text Quality**\n   - Some PDFs may use image text, recommend using OCR tools\n   - Check if PDF is a scanned document\n\n4. **Interface Language Issues**\n   - Extension automatically detects VS Code language settings\n   - Supports Chinese (zh-cn) and English environments\n   - Can change display language in VS Code settings\n\n### Debug Mode\nPress F5 in VS Code to run extension in debug mode and view detailed log information.\n\n## 🤝 Contributing\n\nWelcome to submit Issues and Pull Requests to improve this extension!\n\nSee our [Development Roadmap](ROADMAP.md) for planned features and upcoming improvements.\n\n### Development Environment Setup\n```bash\n# Clone repository\ngit clone https://github.com/hddevteam/vscode-md-converter.git\ncd vscode-md-converter\n\n# Install dependencies\nnpm install\n\n# Development mode compilation\nnpm run watch\n\n# Run tests\nnpm test\n```\n\n### Adding New Language Support\n1. Create new language file in `src/i18n/` directory\n2. Add language detection logic in `src/i18n/index.ts`\n3. Create corresponding `package.nls.{language}.json` file\n\n## 📄 License\n\nMIT License - See [LICENSE](LICENSE) file for details\n\n## 🔄 Changelog\n\n### v0.1.5 - 2025-07-09\n- ✨ **NEW: PDF Table Extraction** - Complete implementation of PDF table extraction to CSV\n- 📊 **NEW: Word Table Extraction** - Enhanced Word table extraction with improved CSV output\n- 🎯 Advanced table detection algorithms for PDF documents with single-space separation support\n- 🌏 Support for complex Chinese forex tables and multi-column data extraction\n- ⚙️ Table output mode selection (separate files vs combined file)\n- 📝 Excel compatibility with UTF-8 BOM for proper Chinese character display\n- 🧹 Simplified PDF parsing architecture using only reliable pdf-parse library\n- 🔧 Enhanced table detection for various table formats (tab, comma, space-separated)\n- 🛠️ Better error handling and user feedback for table extraction operations\n- 📖 Code quality improvements with English comments throughout codebase\n\n### v0.1.4\n- ✨ **NEW: PowerPoint Support** - Full conversion of .pptx presentations to Markdown\n- 🎯 Intelligent slide content extraction with proper text formatting\n- 📝 Speaker notes support with automatic linking to slides\n- 🔧 Enhanced user experience with proper progress reporting\n- ⚠️ User-friendly warnings for legacy .ppt format with conversion guidance\n- 📊 Comprehensive presentation metadata extraction (author, title, subject, slide count)\n- 🧹 Codebase cleanup - removed unnecessary test commands for cleaner interface\n- 🌐 Updated translations and documentation for PowerPoint features\n\n### v0.1.3\n- ✨ Added directory structure preservation in batch conversion\n- 🗂️ Enhanced subfolder handling with original hierarchy\n- 🔄 Improved conversion workflow with better progress reporting\n- 🌐 Updated translations for subfolder structure features\n- 🧩 Fixed additional context menu integration issues\n\n### v0.1.2\n- ✨ Added enhanced folder batch conversion\n- 🔍 Added file count preview before batch conversion\n- 🚀 Added conversion confirmation dialog\n- ⏱️ Added total duration tracking for batch operations\n- 🧩 Fixed folder context menu integration\n- 🌐 Updated internationalization for new UI elements\n\n### v0.1.1\n- 🎨 Updated extension branding to \"OneClick Markdown Converter\"\n- 🌐 Enhanced GitHub Pages website with Chinese version\n- 🎨 Improved color scheme with modern purple-teal gradients\n- 📚 Updated internationalization support\n- 🔧 Enhanced documentation and user guides\n\n### v0.1.0\n- 🚀 Published to VS Code Marketplace\n- 🌐 Added comprehensive GitHub Pages website\n- 📦 Professional packaging and distribution\n- 🎨 Created custom extension icon\n- 📖 Complete documentation and guides\n\n### v0.0.1\n- ✨ Initial release\n- ✅ Word to Markdown conversion\n- ✅ Excel to Markdown conversion  \n- ✅ PDF to text conversion\n- ✅ Batch conversion functionality\n- ✅ VS Code integration\n- ✅ English and Chinese bilingual support\n\n---\n\n**Enjoy the convenience of document conversion!** 🎉\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhddevteam%2Fvscode-md-converter","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhddevteam%2Fvscode-md-converter","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhddevteam%2Fvscode-md-converter/lists"}