Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-pdf
A curated list of resources around PDF files
https://github.com/py-pdf/awesome-pdf
Last synced: 4 days ago
JSON representation
-
Multi-Purpose Libraries
- pypdf - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive): a free and open-source pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
- pdftk - line tool for working with PDFs. It is commonly used for client-side scripting or server-side processing of PDFs.
- pikepdf - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive): a Python library for reading and writing PDF, powered by qpdf
- PyMuPDF - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-render-inactive): Python bindings to MuPDF.
- pdftk - line tool for working with PDFs. It is commonly used for client-side scripting or server-side processing of PDFs.
- pypdf - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive): a free and open-source pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
- pikepdf - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive): a Python library for reading and writing PDF, powered by qpdf
- PyMuPDF - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-render-inactive): Python bindings to MuPDF.
- pypdfium2 - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive) ![](https://shields.io/badge/-render-inactive): Python bindings to PDFium.
- borb - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive): reading, creating and manipulating PDF files in python
- pdfcpu - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive): batch processing and scripting via a rich command line
- pdf-lib - manipulate-inactive) ![](https://shields.io/badge/-create-inactive): Create and modify PDF documents in any JavaScript environment
- HexaPDF - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive): A pure Ruby PDF creation and manipulation library
- pypdfium2 - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive) ![](https://shields.io/badge/-render-inactive): Python bindings to PDFium.
- borb - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive): reading, creating and manipulating PDF files in python
- pdfcpu - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive): batch processing and scripting via a rich command line
- pdf-lib - manipulate-inactive) ![](https://shields.io/badge/-create-inactive): Create and modify PDF documents in any JavaScript environment
- HexaPDF - extract-inactive) ![](https://shields.io/badge/-manipulate-inactive) ![](https://shields.io/badge/-create-inactive): A pure Ruby PDF creation and manipulation library
-
The File Format
- PDF Reference 1.6
- PDF Reference 1.7 (ISO 32000, 2008)
- PDF Reference 2.0 (ISO 32000-2:2020)
- XMP Specification Part 3
- PDF Specification Index
- X is not enough! Grab the PDF by the tail!
- PDF Specification Index
- X is not enough! Grab the PDF by the tail!
- Adobe Open Source Reference
- PDF Reference 1.0
- PDF Reference 1.2
- PDF Reference 1.3
- PDF Reference 1.4
- PDF Reference 1.5 (v6)
- Adobe Open Source Reference
- PDF Reference 1.0
- PDF Reference 1.2
- PDF Reference 1.3
- PDF Reference 1.4
- PDF Reference 1.5 (v6)
- PDF Reference 1.6
- PDF Reference 1.7 (ISO 32000, 2008)
- PDF Reference 2.0 (ISO 32000-2:2020)
- XMP Specification Part 3
- PDF Reference 1.5 (v6)
- PDF Reference 1.0
- PDF Reference 1.3
- PDF Reference 1.4
- PDF Reference 1.2
-
Viewers
- KOReader - ink readers
- react-native-pdf
- PdfViewPager
- vue-pdf
- KOReader - ink readers
- react-native-pdf
- PdfViewPager
- vue-pdf
-
Data Extraction
-
Generators
-
Manipulators
- pdfarranger - gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using a graphical interface
- OCRmyPDF
- pdfarranger - gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using a graphical interface
- OCRmyPDF
-
File Analysis / Security
- Pdfalyzer
- Malicious PDF Generator - home functionality
- pdfbox - app-x.y.z.jar debug pdf_file`
- Pdfalyzer
- Malicious PDF Generator - home functionality
- pdfbox - app-x.y.z.jar debug pdf_file`
Programming Languages
Categories
Sub Categories
Keywords
pdf
44
python
12
pdf-generation
8
pdf-documents
8
library
6
ocr
6
table-extraction
4
epub
4
android
4
tesseract
4
python3
4
document-understanding
2
document-layout-analysis
2
document-intelligence
2
document-analysis
2
document-ai
2
deep-learning
2
awesome-list
2
awesome
2
table
2
for-humans
2
extract
2
tables
2
scraping
2
excel
2
information-extraction
2
intelligent-processing
2
key-information-extraction
2
machine-learning
2
natural-language-processing
2
nlp
2
robotic-process-automation
2
rpa
2
unstructured-data
2
pdf-generator
2
prawn
2
ruby
2
flexbox
2
react
2
renderer
2
ubuntu-touch
2
remarkable-tablet
2
reflow
2
reader
2
pocketbook
2
pdf-reflow
2
opds
2
luajit
2
kobo
2
kindle
2