Projects in Awesome Lists tagged with docx
A curated list of projects in awesome lists tagged with docx .
https://docling-project.github.io/docling/
Get your documents ready for gen AI
ai convert document-parser document-parsing documents docx html markdown pdf pdf-converter pdf-to-json pdf-to-text pptx tables xlsx
Last synced: 26 Jun 2025
https://github.com/docling-project/docling
Get your documents ready for gen AI
ai convert document-parser document-parsing documents docx html markdown pdf pdf-converter pdf-to-json pdf-to-text pptx tables xlsx
Last synced: 09 Sep 2025
https://github.com/ds4sd/docling
Get your documents ready for gen AI
ai convert document-parser document-parsing documents docx html markdown pdf pdf-converter pdf-to-json pdf-to-text pptx tables xlsx
Last synced: 08 Mar 2025
https://github.com/ether/etherpad-lite
Etherpad: A modern really-real-time collaborative document editor.
collaboration collaborative collaborative-editing collaborative-framework collaborative-research collaborative-writing document documents docx etherpad libreoffice microsoft pdf pdf-generation rich-text-editor video-conference video-conferencing web-editor word
Last synced: 28 Dec 2025
https://ds4sd.github.io/docling/
Get your documents ready for gen AI
ai convert document-parser document-parsing documents docx html markdown pdf pdf-converter pdf-to-json pdf-to-text pptx tables xlsx
Last synced: 07 Sep 2025
https://github.com/kekingcn/kkfileview
Universal File Online Preview Project based on Spring-Boot
docx fileview fileviewer java kkfileview office office-view pdf word
Last synced: 14 May 2025
https://github.com/unstructured-io/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Last synced: 09 Sep 2025
https://github.com/kekingcn/kkFileView
Universal File Online Preview Project based on Spring-Boot
docx fileview fileviewer java kkfileview office office-view pdf word
Last synced: 24 Mar 2025
https://github.com/Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Last synced: 26 Mar 2025
https://github.com/kalcaddle/kodexplorer
A web based file manager,web IDE / browser based code editor
archive collaboration docx file-browser file-explorer file-sharing file-upload filemanager free-software ide markdown-editor s3 text-editor webdav xlsx zip
Last synced: 14 May 2025
https://github.com/kalcaddle/KODExplorer
A web based file manager,web IDE / browser based code editor
archive collaboration docx file-browser file-explorer file-sharing file-upload filemanager free-software ide markdown-editor s3 text-editor webdav xlsx zip
Last synced: 27 Mar 2025
https://github.com/kalcaddle/KodExplorer
A web based file manager,web IDE / browser based code editor
archive collaboration docx file-browser file-explorer file-sharing file-upload filemanager free-software ide markdown-editor s3 text-editor webdav xlsx zip
Last synced: 13 Mar 2025
https://github.com/onlyoffice/documentserver
ONLYOFFICE Docs is a free collaborative online office suite comprising viewers and editors for texts, spreadsheets and presentations, forms and PDF, fully compatible with Office Open XML formats: .docx, .xlsx, .pptx and enabling collaborative editing in real time.
collaboration collaborative doc docx excel node node-js nodejs odp ods odt office onlyoffice ppt pptx presentation spreadsheet word xls xlsx
Last synced: 18 Dec 2025
https://github.com/ONLYOFFICE/DocumentServer
ONLYOFFICE Docs is a free collaborative online office suite comprising viewers and editors for texts, spreadsheets and presentations, forms and PDF, fully compatible with Office Open XML formats: .docx, .xlsx, .pptx and enabling collaborative editing in real time.
collaboration collaborative doc docx excel node node-js nodejs odp ods odt office onlyoffice ppt pptx presentation spreadsheet word xls xlsx
Last synced: 03 Apr 2025
https://github.com/dolanmiu/docx
Easily generate and modify .docx files with JS/TS with a nice declarative API. Works for Node and on the Browser.
angular angular2 browser create docs document docx docx-generator express expressjs generate javascript microsoft nodejs office react typescript vuejs word
Last synced: 13 May 2025
https://github.com/QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
docx llm parser pdf powerpoint
Last synced: 30 Aug 2025
https://github.com/501351981/vue-office
支持word(.docx)、excel(.xlsx,.xls)、pdf、pptx等各类型office文件预览的vue组件集合,提供一站式office文件预览方案,支持vue2和3,也支持React等非Vue框架。Web-based pdf, excel, word, pptx preview library
docx docx-preview excel pdf pdf-preview pdf-viewer vue xlsx xlsx-preview
Last synced: 16 May 2025
https://github.com/unidoc/unioffice
Pure go library for creating and processing Office Word (.docx), Excel (.xlsx) and Powerpoint (.pptx) documents
docx ecma-376 excel g go golang ooxml openoffice powerpoint pptx spreadsheet word xlsx
Last synced: 15 May 2025
https://github.com/dotnet/open-xml-sdk
Open XML SDK by Microsoft
docx office openxml-format pptx
Last synced: 18 Dec 2025
https://github.com/dotnet/Open-XML-SDK
Open XML SDK by Microsoft
docx office openxml-format pptx
Last synced: 16 Mar 2025
https://github.com/ONLYOFFICE/DesktopEditors
Open-source office suite pack that comprises all the tools you need to work with documents, spreadsheets, presentations, PDFs, and PDF forms on Windows, Linux, and macOS
collaboration desktop docx excel form-builder office onlyoffice pdf pdf-form pptx presentation spreadsheet word xlsx
Last synced: 05 May 2025
https://github.com/onlyoffice/desktopeditors
Open-source office suite pack that comprises all the tools you need to work with documents, spreadsheets, presentations, PDFs, and PDF forms on Windows, Linux, and macOS
collaboration desktop docx excel form-builder office onlyoffice pdf pdf-form pptx presentation spreadsheet word xlsx
Last synced: 29 Apr 2025
https://github.com/open-xml-templating/docxtemplater
Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxtemplater.com/demo. #docx #office #generator #templating #report #json #generate #generation #template #create #pptx #docx #xlsx #react #vuejs #angularjs #browser #typescript #image #html #table #chart
docx docx-generation docx-generator docx-template excel javascript microsoft office powerpoint pptx template word xlsx
Last synced: 30 Dec 2025
https://github.com/artifexsoftware/pdf2docx
Open source Python library for converting PDF to DOCX.
docx extract-table pdf-converter pdf-to-word pymupdf
Last synced: 14 May 2025
https://github.com/ArtifexSoftware/pdf2docx
Open source Python library for converting PDF to DOCX.
docx extract-table pdf-converter pdf-to-word pymupdf
Last synced: 28 Mar 2025
https://github.com/kalcaddle/kodbox
kodbox is a file manager for web. It is a newly designed product based on kodexplorer. It is also a web code editor, which allows you to develop websites directly within the web browser.You can run kodbox either online or locally,on Linux, Windows or Mac based platforms
docx file-explorer file-manager file-sharing file-upload file-viewer filemanager filesystem free-software javascript markdown-editor php text-editor webdav xlsx
Last synced: 14 May 2025
https://github.com/rstudio/gt
Easily generate information-rich, publication-quality tables from R
docx easy-to-use html latex r rtf summary-tables
Last synced: 12 May 2025
https://github.com/xceedsoftware/docx
Fast and easy to use .NET library that creates or modifies Microsoft Word files without installing Word.
c-sharp docx microsoft-word office
Last synced: 11 May 2025
https://github.com/xceedsoftware/DocX
Fast and easy to use .NET library that creates or modifies Microsoft Word files without installing Word.
c-sharp docx microsoft-word office
Last synced: 09 Apr 2025
https://github.com/n4ze3m/dialoqbase
Create chatbots with ease
anthropic chatbot chatgpt claude code-llama codellama cohere docx google-palm gpt-3 gpt-4 huggingface langchain llama localai mistral pdf private-gpt privategpt
Last synced: 14 May 2025
https://github.com/goldziher/kreuzberg
A text extraction library supporting PDFs, images, office documents and more
asyncio docx ocr pdf text-extraction
Last synced: 14 May 2025
https://github.com/sajari/docconv
Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text
conversion docs docx go html pdf pdf-converter rtf rtf-files word xml
Last synced: 18 Dec 2025
https://github.com/jesselau76/ebook-GPT-translator
Enjoy reading with your favorite style.
docx epub mobi pdf python translation translator
Last synced: 24 Mar 2025
https://github.com/jesselau76/ebook-gpt-translator
Enjoy reading with your favorite style.
docx epub mobi pdf python translation translator
Last synced: 15 May 2025
https://github.com/guigrpa/docx-templates
Template-based docx report creation
docx docx-file docx-generator ms-word reporting template
Last synced: 13 May 2025
https://github.com/skywalkerdarren/chatweb
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
ai chatgpt crawler docx embedding faiss gpt gpt-35-turbo news-extractor newspaper openai pdf pgvector postgresql vector-database
Last synced: 25 Oct 2025
https://github.com/SkywalkerDarren/chatWeb
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
ai chatgpt crawler docx embedding faiss gpt gpt-35-turbo news-extractor newspaper openai pdf pgvector postgresql vector-database
Last synced: 30 Mar 2025
https://github.com/quivrhq/megaparse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
docx llm parser pdf powerpoint
Last synced: 02 Aug 2025
https://github.com/mattn/docx2md
Convert Microsoft Word Document to Markdown
Last synced: 15 May 2025
https://github.com/davidgohel/flextable
table farming
cran docx html5 ms-office-documents r rmarkdown rstats table
Last synced: 21 Oct 2025
https://github.com/nolze/msoffcrypto-tool
Python tool and library for decrypting and encrypting MS Office files using passwords or other keys
command-line decryption doc docx encryption ms-offcrypto ole ooxml ppt pptx xls xlsx
Last synced: 15 Dec 2025
https://github.com/fiduswriter/fiduswriter
Fidus Writer is an online collaborative editor for academics.
academics biblatex bibtex citations docx jats jats-publishing jats-xml latex math opendocument text-editor word
Last synced: 14 May 2025
https://github.com/explosion/spacy-layout
📚 Process PDFs, Word documents and more with spaCy
document-layout document-layout-analysis docx generative-ai natural-language-processing nlp pdf pdf-converter rag spacy
Last synced: 14 May 2025
https://github.com/gen2brain/go-fitz
Golang wrapper for the MuPDF Fitz library
docx epub fitz golang golang-wrapper mupdf pdf pptx xlsx
Last synced: 15 May 2025
https://github.com/tobya/DocTo
Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF
conversion converter csv csv-export delphi delphi10 docto docx excel pdf pdf-generation windows word-documents xls xls-files xlsx
Last synced: 19 Mar 2025
https://github.com/amiremohamadi/DuckX
C++ library for creating and modifying Microsoft Word (.docx) files
cpp cpp-library docx docx-files office
Last synced: 08 May 2025
https://github.com/amiremohamadi/duckx
C++ library for creating and modifying Microsoft Word (.docx) files
cpp cpp-library docx docx-files office
Last synced: 04 Apr 2025
https://github.com/ruby-docx/docx
a ruby library/gem for interacting with .docx files
docx hacktoberfest office-open-xml ooxml ooxml-parser ruby rubygem word
Last synced: 14 Jul 2025
https://github.com/onizet/html2openxml
Html2OpenXml is a small .Net library that convert simple or advanced HTML to plain OpenXml components. This program has started in 2009, initially to convert user's comments into templated Word.
docx dotnet-core openxml openxml-sdk
Last synced: 14 May 2025
https://github.com/rmzelle/ref-extractor
Reference Extractor - Extract Zotero/Mendeley references from Microsoft Word files
citations docx mendeley microsoft-word zotero
Last synced: 09 Apr 2025
https://github.com/evotecit/officeimo
Fast and easy to use cross-platform .NET library that creates or modifies Microsoft Word (DocX) and later also Excel (XLSX) files without installing any software. Library is based on Open XML SDK
c-sharp docx office openxml openxml-sdk word
Last synced: 16 May 2025
https://github.com/shcherbak-ai/contextgem
ContextGem: Effortless LLM extraction from documents
ai contract-analysis data-extraction document-intelligence docx docx2md docx2txt generative-ai legaltech llm llm-extraction llm-framework llm-pipeline llms nlp prompt-engineering text-analysis unstructured-data
Last synced: 13 May 2025
https://github.com/gotenberg/gotenberg-php
🐘 A PHP client for interacting with Gotenberg.
chromium conversion csv docx excel google-chrome gotenberg html libreoffice markdown pdf pdftk php pptx puppeteer unoconv wkhtmltopdf word xlsx
Last synced: 14 May 2025
https://github.com/maehr/academic-pandoc-template
Write beautifully typeset academic texts with distraction-free Markdown and Pandoc.
academic-publishing docx docx-generator github-action github-actions latex latex-template markdown markdown-converter pandoc pandoc-template pdf pdf-generator tectonic
Last synced: 22 Aug 2025
https://github.com/lukasjarosch/go-docx
Replace placeholders inside docx documents with speed and confidence.
docx golang microsoft ooxml word wordprocessingml
Last synced: 05 Apr 2025
https://github.com/EvotecIT/PSWriteWord
PSWriteWord is powershell module to create Microsoft Word documents without Microsoft Word installed...
docx hacktoberfest microsoft microsoft-office microsoft-word powershell
Last synced: 10 Apr 2025
https://github.com/ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
doc document-analysis document-content-extraction documents docx docx-parser excel html html-parser logical-structure-extraction ocr odt pdf pdf-parser scanned-documents table-of-contents table-recognition txt
Last synced: 15 May 2025
https://github.com/brsloan/warewoolf
A minimalist novel-writing system/rich text editor designed to be usable without a mouse. For desktop and standalone word processors/digital typewriters/writerDecks.
docx editor fiction markdown novel-writing quill rich-text-editor text-editor word-processor writerdeck writing-software
Last synced: 17 Oct 2025
https://github.com/nabilanavab/ilovepdf
Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..
doc docx heroku ilovepdf image-to-pdf nabil-a-navab nabilanavab pdf pdf-to-image pillow pyrogram pyrogram-bot pytelegrambotapi python python3 telegram telegram-api telegram-bot
Last synced: 22 Jul 2025
https://github.com/blaspsoft/doxswap
📄 🔄 Doxswap is a Laravel package for seamless document conversion using LibreOffice. Effortlessly convert DOCX, PDF, ODT, and more with a simple, elegant API. Supports Laravel storage, configurable settings, and secure file handling.
converter document-conversion document-converter docx file-conversion laravel libreoffice odt pdf php xlsx
Last synced: 22 Apr 2025
https://github.com/fumiama/go-docx
One of the most functional libraries to partially read and write .docx files (a.k.a. Microsoft Word documents or ECMA-376 Office Open XML) in Go.
docx docx-converter docx-files docx-generator openxml openxml-format openxml-sdk openxml-word word xml
Last synced: 12 Apr 2025
https://github.com/hrbrmstr/docxtractr
:scissors: Extract Tables from Microsoft Word Documents with R
docx extract-tables microsoft-word r rstats table-extraction
Last synced: 08 May 2025
https://github.com/foliant-docs/foliant
Comprehensive markdown-based documentation toolkit
docs documentation docx latex markdown python
Last synced: 29 Jul 2025
https://github.com/kmwallio/ThiefMD
The markdown editor worth stealing. Inspired by Ulysses, based on code from Quilter
docx editor epub flathub fountain fountain-editor ghost-client markdown markdown-editor novel novel-writing pandoc screenplay ulysses vala wordpress-client writeas-client writing-software writing-tool
Last synced: 21 Jul 2025
https://github.com/kmwallio/thiefmd
The markdown editor worth stealing. Inspired by Ulysses, based on code from Quilter
docx editor epub flathub fountain fountain-editor ghost-client markdown markdown-editor novel novel-writing pandoc screenplay ulysses vala wordpress-client writeas-client writing-software writing-tool
Last synced: 08 Aug 2025
https://github.com/erdos/stencil
templating engine for DOCX and PPTX files
docx ooxml ooxml-parser pptx template-engine
Last synced: 21 Oct 2025
https://github.com/mnvx/lowrapper
PHP wrapper over LibreOffice converter
docx docx-generator html libreoffice libreoffice-converter pdf pdf-converter pdf-generation
Last synced: 15 May 2025
https://github.com/curvenote/prosemirror-docx
Export a prosemirror document to a Microsoft Word file, using docx.
docx docx-generator prosemirror writing
Last synced: 16 May 2025
https://github.com/ONLYOFFICE/DocumentBuilder
ONLYOFFICE Document Builder is powerful text, spreadsheet, presentation and PDF generating tool
doc documentbuilder docx excel node node-js nodejs odp ods odt onlyoffice ppt pptx presentation spreadsheet word xls xlsx
Last synced: 10 May 2025
https://github.com/onlyoffice/documentbuilder
ONLYOFFICE Document Builder is powerful text, spreadsheet, presentation and PDF generating tool
doc documentbuilder docx excel node node-js nodejs odp ods odt onlyoffice ppt pptx presentation spreadsheet word xls xlsx
Last synced: 04 Apr 2025
https://github.com/sonnenkern/quip-export
Export all folders and documents from Quip
503 annotations api automation backup chat comments conversation docx docx-export embedded-images export folders html-export office quip quip-api quip-export xlsx zero-configuration
Last synced: 10 Apr 2025
https://github.com/cherfia/chromiumly
A lightweight Typescript library that interacts with Gotenberg's different modules to convert a variety of document formats to PDF files.
chromium docx gotenberg html libreoffice markdown pdf pptx screenshots xlsx
Last synced: 16 May 2025
https://github.com/harbour-enterprises/superdoc
🦋️ SuperDoc - modern document editing
collaborative docx editor harbour javascript ms-word ms-word-docx-file pdf real-time rich-text-editor wysiwyg
Last synced: 06 Jan 2026
https://github.com/liumapp/workable-converter
基于libreoffice实现的文档转换项目,无框架依赖,即插即用
convert doc docx java library libreoffice maven pdf ppt
Last synced: 15 Apr 2025
https://github.com/code-boxx/invoicr
Invoicr is a free and open-source PHP invoice generator that is capable of churning out HTML, PDF, DOCX, and PNG invoices.
docx invoice invoice-generator invoice-template pdf php xlsx
Last synced: 14 Mar 2025
https://github.com/asutosh11/documentreader
This library reads word documents (.doc and .docx), txt and PDF files, and gives the output content of the document as a String.
android-library docparser docx filereader kotlin kotlin-android pdf pdf-document pdfreader txtreader
Last synced: 28 Sep 2025
https://github.com/hufe921/canvas-editor-plugin
plugins for canvas-editor
barcode canvas-editor codeblock docx docx-generator excel floating-toolbar plugin
Last synced: 09 Apr 2025
https://github.com/TomBener/pandoc-templates
Pandoc templates to convert Markdown files to DOCX, PDF and HTML for academic writing, especially for Chinese writing.
docx html latex pandoc pandoc-templates pdf
Last synced: 06 May 2025
https://github.com/trinker/textreadr
Tools to uniformly read in text data including semi-structured transcripts
doc docx pdf-reading r read-transcripts text-data text-mining
Last synced: 16 Mar 2025
https://github.com/greyovo/markdocx
Convert Markdown to Word (.docx). / 将 markdown 文件转换为 Word(.docx)
docx markdown markdown-to-word md2word ms-word word
Last synced: 23 Oct 2025
https://github.com/gamemaker1/office-text-extractor
Yet another library to extract text from MS Office and PDF files
docx get-text ms-excel ms-office ms-powerpoint ms-word parser pdf pptx text-extraction xlsx
Last synced: 26 Dec 2025