Projects in Awesome Lists tagged with pdfs
A curated list of projects in awesome lists tagged with pdfs .
https://github.com/tabulapdf/tabula-java
Extract tables from PDF files
extracting-tables extraction-engine pdfs
Last synced: 08 Apr 2025
https://github.com/trimstray/technical-whitepapers
Collection of IT whitepapers, presentations, pdfs; hacking, web app security, db, reverse engineering and more; EN/PL.
bsd databases devops hacking linux manuals pdfs pentesters security security-researchers sysadmin whitepapers
Last synced: 06 Apr 2025
https://github.com/harishdeivanayagam/rowfill
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
document document-extraction document-parsing image-ocr langgraph llama llm nextjs ocr ocr-javascript ollama openai pdf pdfs unstructured unstructured-data vision vision-api
Last synced: 13 Apr 2025
https://github.com/bobld/tabula-sharp
Extract tables from PDF files (port of tabula-java)
csharp dotnet extract extract-table extracting-tables extraction extraction-engine netstandard pdf-table-extract pdf-table-extraction pdfparser pdfpig pdfs table table-extraction tabula tabula-java tabula-sharp
Last synced: 15 Apr 2025
https://github.com/timvink/mkdocs-print-site-plugin
MkDocs Plugin that adds an additional page that combines all pages, allowing easy exports to PDF and standalone HTML.
mkdocs mkdocs-plugin pdf pdfs print
Last synced: 07 Apr 2025
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 04 Apr 2025
https://github.com/apurvmishra99/pdf-to-scan
Make your PDFs look like they were scanned
ghostscript imagemagick pdfs scan scanned-documents
Last synced: 03 Apr 2025
https://github.com/lzkelley/kalepy
Kernel Density Estimation and (re)sampling
kde kernel-density-estimation pdfs python python3 resample resampling statistical-analysis statistics
Last synced: 13 Feb 2025
https://github.com/bobld/camelot-sharp
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
camelot camelot-sharp csharp dotnet extract-table extracting-tables extraction extraction-engine netstandard opencv pdf-table-extract pdf-table-extraction pdfparser pdfpig pdfs table table-extraction
Last synced: 14 Apr 2025
https://github.com/kairavkkp/merge-pdf
My first PyPi Package. Merge Image and PDF files using customizations within a folder using the Command line.
epubs hacktoberfest hacktoberfest2021 merge-images merge-pdf merge-pdf-images pdf-files pdfs
Last synced: 19 Mar 2025
https://github.com/divinemonk/notes-for-hackers
Study material (pdfs, notes, free course download links etc) for HACKERS
api articles books computer cyber-security cybersecurity documents ebook ebooks ethical-hacking files hacker hackers hacking multimedia notes pdfs reads study study-notes
Last synced: 30 Mar 2025
https://github.com/thawkin3/html-to-pdf-demo
Demo of exporting HTML content as PDFs using various html-to-pdf libraries
html javascript pdf-demo pdf-libraries pdfs
Last synced: 14 Apr 2025
https://github.com/lundegaard/gatsby-plugin-pdf
Gatsby plugin that is able generate PDFs out of your gatsby web pages
gatsby gatsby-plugin pdfs puppetter react
Last synced: 10 Feb 2025
https://github.com/hadro/new-york-city-directories
Some basic data and text extraction from the New York City Directories
brooklyn digital-humanities hocr manhattan new-york-city-directories nypl ocr pdfs
Last synced: 12 Mar 2025
https://github.com/the-swarm-corporation/doc-master
A powerful, lightweight Python library for automated file reading and content extraction. Doc Master simplifies the process of reading various file formats into string representations, making it perfect for data processing, content analysis, and document management systems.
agents ai chatagents chatwithyourpdf doc-agents docs pdfs production-grade-aegnts swarms swarms-of-agents txts
Last synced: 30 Jan 2025
https://github.com/txuswashere/snes-manuals
Collection of All US SNES Manuals and All PAL Exclusive manuals.
documentation documents game-manual gamemanuals manual manuals pdf pdf-files pdfs snes snes-manuals snesmanuals super-nintendo super-nintendo-entertainment-system supernintendo
Last synced: 25 Mar 2025
https://github.com/akshaybadola/ref-man
Emacs plugin to manage bibliography having tight integration with org-mode and eww.
bibliography bibtex elisp emacs eww org-mode pdfs science-parse semanticscholar
Last synced: 23 Feb 2025
https://github.com/dave-007/presentations
Repository for source files in my past and upcoming presentations. Powershell, SQL, Azure or AWS command line script files, code for examples and demos, supporting slides as PDFs.
awscli azure-cli demos pdfs powershell sql supporting-slides
Last synced: 04 Mar 2025
https://github.com/txuswashere/nes-manuals
Manuals for the games released on the NES (Nintendo Entertainment System)
documentation documents manual manuals nes nintendo-entertainment-system pdf pdfs
Last synced: 25 Mar 2025
https://github.com/imvickykumar999/unacademy
Storage for PDFs and other Study Materials... PLUSNQ45S https://unacademy.com/goal/gate-ese/PESHE/practice?subject_uid=PQQFK
2023 air1 gate pdfs practice sanchit-jain unacademy
Last synced: 24 Mar 2025
https://github.com/conflictingtheories/contractjs
A javascript framework for drafting up contracts using javascript. *See License file for information*
contract contracts engine javascript javascript-framework latex latex-document latex-pdf latex-template pdf pdf-document pdf-generation pdfs transpiler
Last synced: 05 Mar 2025
https://github.com/biglocalnews/usc-crime-reports-scraper
A GitHub Action workflow for automating the collection of crime and fire logs posted by the University of Southern California's Department of Public Safety.
actions crime data-journalism journalism news pdfs python web-scraper
Last synced: 18 Apr 2025
https://github.com/txuswashere/yo-tenia-un-juego
https://www.yoteniaunjuego.com/
arcade arcade-game arcade-games games gaming magazine magazines pdf pdfs retro-gaming retrogame retrogames retrogaming yoteniaunjuego
Last synced: 25 Mar 2025
https://github.com/kshithijiyer/mynotes
This is a repository where I'll upload all my notes which can be helpful for anyone and everyone who needs help.
jpg-images markdown notes pdfs
Last synced: 13 Apr 2025
https://github.com/democritus-project/d8s-pdfs
Democritus functions for working with PDFs.
democritus pdf pdfs pdfs-utility python utility
Last synced: 01 Apr 2025
https://github.com/niedsonemanoel/lib-geradora-de-certificado-pdf
Biblioteca simples feita com o ejs com o instuito de gerar certificados em pdf.
Last synced: 24 Mar 2025
https://github.com/ansh420/fastapi
API links for the text Extract from Url's and pdf's and finding similar words using cosine.
docker-container fastapi ma pdfs python3 urls
Last synced: 05 Apr 2025
https://github.com/kavex/pdf-combine
Allows you to combine multi pdfs into into one pdf. Add, Rearrange, or Delete Pages.
add combine combiner edit export pdf pdf-converter pdf-document pdf-files pdfs python python-3 python-app python-script python3 remove
Last synced: 07 Mar 2025
https://github.com/dulajkavinda/pdf-lecture
📄Convert PDFs into vanilla text.
nodejs npm npm-package open-source pdf-converter pdf-lecture pdf2json pdfs
Last synced: 01 Apr 2025
https://github.com/applitools/example-image-tester-cli
Applitools Example: Image Tester CLI
applitools applitools-example applitools-image-tester applitools-tutorial image-files images pdfs screenshots test-automation tutorial
Last synced: 06 Mar 2025
https://github.com/benweston/course-certificates
Serves course certificate PDFs to the benweston repo.
cloud-storage cloud-storage-bucket course-certificates pdfs storage terraform
Last synced: 06 Mar 2025
https://github.com/hitthecodelabs/pdfs_textreplacement
A Python utility for replacing specific text in a PDF file.
Last synced: 29 Mar 2025
https://github.com/hbiede/voter-tokens
A Ruby tool to generate tokens for voters based on delegate counts for online elections and then validate those tokens when counting votes
ballots counting-votes google-sheets latex online-elections parse-ballots pdfs voter-tokens
Last synced: 03 Apr 2025
https://github.com/gabrielzschmitz/research
A repository showcasing my research papers, summaries, and bibliographic resources with organized access and reference materials.
academic bibliography latex papers pdfs research summaries
Last synced: 07 Apr 2025