Projects in Awesome Lists tagged with pdf-documents
A curated list of projects in awesome lists tagged with pdf-documents .
https://github.com/py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
help-wanted pdf pdf-documents pdf-manipulation pdf-parser pdf-parsing pypdf2 python
Last synced: 15 Jan 2026
https://github.com/py-pdf/PyPDF2
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
help-wanted pdf pdf-documents pdf-manipulation pdf-parser pdf-parsing pypdf2 python
Last synced: 17 Aug 2025
https://github.com/mstamy2/PyPDF2
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
help-wanted pdf pdf-documents pdf-manipulation pdf-parser pdf-parsing pypdf2 python
Last synced: 02 Apr 2025
https://github.com/pymupdf/pymupdf
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 01 Apr 2026
https://github.com/pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 08 Apr 2025
https://github.com/pypdfium2-team/pypdfium2
Python bindings to PDFium
pdf pdf-documents pdf-to-image pdfium python rasterisation
Last synced: 05 Jan 2026
https://github.com/michelcrypt4d4mus/pdfalyzer
Analyze PDFs with colors (and YARA)
malicious-pdf-files malware-analysis pdf pdf-documents pdf-format pdf-parser yara yara-rules yara-scanner
Last synced: 25 Jan 2026
https://github.com/mnishihan/laravel-docs-in-pdf
Up-to-date Laravel documentation in PDF format (all versions)
docs documentation laravel laravel-pdf-docs pdf pdf-documents php
Last synced: 23 Jan 2026
https://github.com/podofo/podofo
A C++17 PDF manipulation library
cplusplus cpp pdf pdf-documents pdf-files pdf-generation
Last synced: 05 Apr 2025
https://github.com/rffrasca/pdfkeeper
Open Source PDF Document Management
csharp database dotnet-framework dotnet-framework-48 full-text-search mysql ocr oracle oracle-cloud oracle-database oracledb pdf pdf-document pdf-document-management pdf-documents sql-server sqlite sqlserver winforms
Last synced: 11 Oct 2025
https://github.com/erikkastelec/pdfscraper
CLI program for searching inside text and tables in PDF documents and displaying results in HTML.
camelot ocr ocr-analysis pdf-documents pdfminer
Last synced: 26 Jun 2025
https://github.com/sjain882/ocrmypdf-wingui
Simple frontend for OCRmyPDF (Windows only).
csharp desktop-app dotnet ocr-pdf ocrmypdf pdf pdf-document pdf-documents search-pdf windows wpf
Last synced: 19 Apr 2026
https://github.com/alexreg/pdfmarks
Utility for modifying page markings of PDF documents
page-markings pdf-documents scripts tools utilities
Last synced: 29 Apr 2026
https://github.com/cajuncoding/pdfhelpers
Lightweight Helper classes based on iTextSharp for scaling and resizing Pdf Documents & Pages.
concatenate-files concatenate-image concatenate-pdf merge-files pdf pdf-conversion pdf-converter pdf-documents pdf-generation pdf-merge pdf-resize
Last synced: 01 May 2025
https://github.com/slootjes/dms
Dockerized PHP app to search PDF documents.
docker docker-compose document-management pdf-documents php
Last synced: 19 Apr 2026
https://github.com/timothy-bartlett/pymupdf
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction text-processing text-shaping xps
Last synced: 18 May 2026
https://github.com/nilostolte/pdfbox
This project offers several versions of PDFBox source code that can be compiled with Eclipse. The complete version is a complete unmodified PDFBox with all packages normally not included in PDFBox source code. The other versions are modified versions offering more capabilities.
eclipse java pdf-converter pdf-debugger pdf-documents pdf-viewer pdfbox pdfbox-source
Last synced: 10 Feb 2026