An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with pdf-documents

A curated list of projects in awesome lists tagged with pdf-documents .

https://github.com/py-pdf/pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

help-wanted pdf pdf-documents pdf-manipulation pdf-parser pdf-parsing pypdf2 python

Last synced: 15 Jan 2026

https://github.com/py-pdf/PyPDF2

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

help-wanted pdf pdf-documents pdf-manipulation pdf-parser pdf-parsing pypdf2 python

Last synced: 17 Aug 2025

https://github.com/mstamy2/PyPDF2

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

help-wanted pdf pdf-documents pdf-manipulation pdf-parser pdf-parsing pypdf2 python

Last synced: 02 Apr 2025

https://github.com/pymupdf/pymupdf

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps

Last synced: 01 Apr 2026

https://github.com/pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps

Last synced: 08 Apr 2025

https://github.com/mnishihan/laravel-docs-in-pdf

Up-to-date Laravel documentation in PDF format (all versions)

docs documentation laravel laravel-pdf-docs pdf pdf-documents php

Last synced: 23 Jan 2026

https://github.com/podofo/podofo

A C++17 PDF manipulation library

cplusplus cpp pdf pdf-documents pdf-files pdf-generation

Last synced: 05 Apr 2025

https://github.com/erikkastelec/pdfscraper

CLI program for searching inside text and tables in PDF documents and displaying results in HTML.

camelot ocr ocr-analysis pdf-documents pdfminer

Last synced: 26 Jun 2025

https://github.com/alexreg/pdfmarks

Utility for modifying page markings of PDF documents

page-markings pdf-documents scripts tools utilities

Last synced: 29 Apr 2026

https://github.com/cajuncoding/pdfhelpers

Lightweight Helper classes based on iTextSharp for scaling and resizing Pdf Documents & Pages.

concatenate-files concatenate-image concatenate-pdf merge-files pdf pdf-conversion pdf-converter pdf-documents pdf-generation pdf-merge pdf-resize

Last synced: 01 May 2025

https://github.com/slootjes/dms

Dockerized PHP app to search PDF documents.

docker docker-compose document-management pdf-documents php

Last synced: 19 Apr 2026

https://github.com/timothy-bartlett/pymupdf

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

data-science extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction text-processing text-shaping xps

Last synced: 18 May 2026

https://github.com/nilostolte/pdfbox

This project offers several versions of PDFBox source code that can be compiled with Eclipse. The complete version is a complete unmodified PDFBox with all packages normally not included in PDFBox source code. The other versions are modified versions offering more capabilities.

eclipse java pdf-converter pdf-debugger pdf-documents pdf-viewer pdfbox pdfbox-source

Last synced: 10 Feb 2026