An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with pdfs

A curated list of projects in awesome lists tagged with pdfs .

https://github.com/tabulapdf/tabula-java

Extract tables from PDF files

extracting-tables extraction-engine pdfs

Last synced: 08 Apr 2025

https://github.com/trimstray/technical-whitepapers

Collection of IT whitepapers, presentations, pdfs; hacking, web app security, db, reverse engineering and more; EN/PL.

bsd databases devops hacking linux manuals pdfs pentesters security security-researchers sysadmin whitepapers

Last synced: 06 Apr 2025

https://github.com/harishdeivanayagam/rowfill

Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers

document document-extraction document-parsing image-ocr langgraph llama llm nextjs ocr ocr-javascript ollama openai pdf pdfs unstructured unstructured-data vision vision-api

Last synced: 13 Apr 2025

https://github.com/timvink/mkdocs-print-site-plugin

MkDocs Plugin that adds an additional page that combines all pages, allowing easy exports to PDF and standalone HTML.

mkdocs mkdocs-plugin pdf pdfs print

Last synced: 07 Apr 2025

https://github.com/apurvmishra99/pdf-to-scan

Make your PDFs look like they were scanned

ghostscript imagemagick pdfs scan scanned-documents

Last synced: 03 Apr 2025

https://github.com/kairavkkp/merge-pdf

My first PyPi Package. Merge Image and PDF files using customizations within a folder using the Command line.

epubs hacktoberfest hacktoberfest2021 merge-images merge-pdf merge-pdf-images pdf-files pdfs

Last synced: 19 Mar 2025

https://github.com/thawkin3/html-to-pdf-demo

Demo of exporting HTML content as PDFs using various html-to-pdf libraries

html javascript pdf-demo pdf-libraries pdfs

Last synced: 14 Apr 2025

https://github.com/lundegaard/gatsby-plugin-pdf

Gatsby plugin that is able generate PDFs out of your gatsby web pages

gatsby gatsby-plugin pdfs puppetter react

Last synced: 10 Feb 2025

https://github.com/hadro/new-york-city-directories

Some basic data and text extraction from the New York City Directories

brooklyn digital-humanities hocr manhattan new-york-city-directories nypl ocr pdfs

Last synced: 12 Mar 2025

https://github.com/the-swarm-corporation/doc-master

A powerful, lightweight Python library for automated file reading and content extraction. Doc Master simplifies the process of reading various file formats into string representations, making it perfect for data processing, content analysis, and document management systems.

agents ai chatagents chatwithyourpdf doc-agents docs pdfs production-grade-aegnts swarms swarms-of-agents txts

Last synced: 30 Jan 2025

https://github.com/akshaybadola/ref-man

Emacs plugin to manage bibliography having tight integration with org-mode and eww.

bibliography bibtex elisp emacs eww org-mode pdfs science-parse semanticscholar

Last synced: 23 Feb 2025

https://github.com/dave-007/presentations

Repository for source files in my past and upcoming presentations. Powershell, SQL, Azure or AWS command line script files, code for examples and demos, supporting slides as PDFs.

awscli azure-cli demos pdfs powershell sql supporting-slides

Last synced: 04 Mar 2025

https://github.com/txuswashere/nes-manuals

Manuals for the games released on the NES (Nintendo Entertainment System)

documentation documents manual manuals nes nintendo-entertainment-system pdf pdfs

Last synced: 25 Mar 2025

https://github.com/imvickykumar999/unacademy

Storage for PDFs and other Study Materials... PLUSNQ45S https://unacademy.com/goal/gate-ese/PESHE/practice?subject_uid=PQQFK

2023 air1 gate pdfs practice sanchit-jain unacademy

Last synced: 24 Mar 2025

https://github.com/conflictingtheories/contractjs

A javascript framework for drafting up contracts using javascript. *See License file for information*

contract contracts engine javascript javascript-framework latex latex-document latex-pdf latex-template pdf pdf-document pdf-generation pdfs transpiler

Last synced: 05 Mar 2025

https://github.com/biglocalnews/usc-crime-reports-scraper

A GitHub Action workflow for automating the collection of crime and fire logs posted by the University of Southern California's Department of Public Safety.

actions crime data-journalism journalism news pdfs python web-scraper

Last synced: 18 Apr 2025

https://github.com/anuraganalog/365-data-science

A Repository which contains lecture notes, exercise, solutions

365 data exercises ipynb lecture notes pdfs python python3 science solutions sql

Last synced: 12 Mar 2025

https://github.com/kshithijiyer/mynotes

This is a repository where I'll upload all my notes which can be helpful for anyone and everyone who needs help.

jpg-images markdown notes pdfs

Last synced: 13 Apr 2025

https://github.com/democritus-project/d8s-pdfs

Democritus functions for working with PDFs.

democritus pdf pdfs pdfs-utility python utility

Last synced: 01 Apr 2025

https://github.com/niedsonemanoel/lib-geradora-de-certificado-pdf

Biblioteca simples feita com o ejs com o instuito de gerar certificados em pdf.

certificates ejs events pdfs

Last synced: 24 Mar 2025

https://github.com/ansh420/fastapi

API links for the text Extract from Url's and pdf's and finding similar words using cosine.

docker-container fastapi ma pdfs python3 urls

Last synced: 05 Apr 2025

https://github.com/kavex/pdf-combine

Allows you to combine multi pdfs into into one pdf. Add, Rearrange, or Delete Pages.

add combine combiner edit export pdf pdf-converter pdf-document pdf-files pdfs python python-3 python-app python-script python3 remove

Last synced: 07 Mar 2025

https://github.com/dulajkavinda/pdf-lecture

📄Convert PDFs into vanilla text.

nodejs npm npm-package open-source pdf-converter pdf-lecture pdf2json pdfs

Last synced: 01 Apr 2025

https://github.com/benweston/course-certificates

Serves course certificate PDFs to the benweston repo.

cloud-storage cloud-storage-bucket course-certificates pdfs storage terraform

Last synced: 06 Mar 2025

https://github.com/hitthecodelabs/pdfs_textreplacement

A Python utility for replacing specific text in a PDF file.

pdf pdfs python

Last synced: 29 Mar 2025

https://github.com/hbiede/voter-tokens

A Ruby tool to generate tokens for voters based on delegate counts for online elections and then validate those tokens when counting votes

ballots counting-votes google-sheets latex online-elections parse-ballots pdfs voter-tokens

Last synced: 03 Apr 2025

https://github.com/gabrielzschmitz/research

A repository showcasing my research papers, summaries, and bibliographic resources with organized access and reference materials.

academic bibliography latex papers pdfs research summaries

Last synced: 07 Apr 2025