awesome-pdf
A curated list of resources around PDF files
https://github.com/py-pdf/awesome-pdf
Last synced: 3 days ago
JSON representation
-
Multi-Purpose Libraries
- pypdf - extract-inactive) : a free and open-source pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
- pdftk - line tool for working with PDFs. It is commonly used for client-side scripting or server-side processing of PDFs.
- borb - extract-inactive)  : reading, creating and manipulating PDF files in python
- HexaPDF - extract-inactive)  : A pure Ruby PDF creation and manipulation library
- pdfcpu - extract-inactive)  : batch processing and scripting via a rich command line
- pdf-lib - manipulate-inactive) : Create and modify PDF documents in any JavaScript environment
- pypdf - extract-inactive) : a free and open-source pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
- HexaPDF - extract-inactive)  : A pure Ruby PDF creation and manipulation library
- pypdfium2 - extract-inactive)   : Python bindings to PDFium.
- pikepdf - extract-inactive) : a Python library for reading and writing PDF, powered by qpdf
- PyMuPDF - extract-inactive)  : Python bindings to MuPDF.
-
Data Extraction
-
Generators
-
The File Format
- PDF Reference 1.0
- PDF Reference 1.2
- PDF Reference 1.6
- PDF Reference 1.7 (ISO 32000, 2008)
- PDF Specification Index
- Adobe Open Source Reference
- PDF Reference 1.0
- PDF Reference 1.2
- PDF Reference 1.3
- PDF Reference 1.4
- PDF Reference 1.5 (v6)
- PDF Reference 1.5 (v6)
- PDF Reference 1.3
- PDF Specification Index
- PDF Specification Index
- PDF Reference 1.4
- Adobe Open Source Reference
- X is not enough! Grab the PDF by the tail!
- PDF Reference 2.0 (ISO 32000-2:2020)
- XMP Specification Part 3
-
File Analysis / Security
- pdfbox - app-x.y.z.jar debug pdf_file`
- Malicious PDF Generator - home functionality
- Pdfalyzer
-
Viewers
- PdfViewPager
- KOReader - ink readers
- react-native-pdf
- vue-pdf
-
Manipulators
- OCRmyPDF
- pdfarranger - gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using a graphical interface
Programming Languages
Categories
Sub Categories
Keywords
pdf
22
python
6
pdf-documents
4
pdf-generation
4
ocr
3
library
3
table-extraction
2
python3
2
epub
2
android
2
tesseract
2
extract
1
pdf-parsing
1
ruby
1
prawn
1
pdf-generator
1
image-processing
1
renderer
1
react
1
flexbox
1
tables
1
penetration-test
1
bugbounty-tool
1
bugbounty
1
qpdf
1
pypdf2
1
pikepdf
1
pdf-manipulation
1
existing-pdfs
1
template
1
md
1
markdown
1
converter
1
utf-8
1
php
1
html
1
css
1
cjk
1
table
1
for-humans
1
remarkable-tablet
1
reflow
1
reader
1
pocketbook
1
pdf-reflow
1
opds
1
luajit
1
kobo
1
kindle
1
fb2
1