Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with pdf
A curated list of projects in awesome lists tagged with pdf .
https://github.com/clonemeagain/attachment_preview
osTicket Plugin: Allows inline view of attachments
attachment osticket pdf plugin
Last synced: 09 Nov 2024
https://github.com/GerHobbelt/qiqqa-open-source
The open-sourced version of the award-winning Qiqqa research management tool for Windows (a bleeding edge dev fork) ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ☞☞☞ File any issues you find in the main repo issue tracker at https://github.com/jimmejardine/qiqqa-open-source/issues
citations document-classification document-management meta-analysis metadata mupdf pdf qiqqa tesseract
Last synced: 06 Nov 2024
https://github.com/vladocar/pdfsave
Convert websites into readable PDFs
cli node node-js nodejs pdf pdf-converter pdf-generation readability readable
Last synced: 20 Nov 2024
https://github.com/songsterq/react-native-pdf-thumbnail
react native module for generating thumbnail for PDF files
Last synced: 10 Jan 2025
https://github.com/wazzabeee/copy-spotter
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
beautifulsoup bs4 docx odt pdf plagiarism plagiarism-check plagiarism-checker plagiarism-detection plagiarism-detector python side-by-sidediff similarity similarity-detection similarity-score txt
Last synced: 16 Dec 2024
https://github.com/zagoli/simple-typst-thesis
A template to use for a simple thesis written in Typst.
Last synced: 14 Jan 2025
https://github.com/carleslc/readteractive
Tool for writing and generating interactive books.
book e-book epub gamebook html interactive interactive-fiction interactive-storytelling kindle latex mobi narrative pandoc pdf reading story storybook storytelling writing writing-tool
Last synced: 28 Oct 2024
https://github.com/aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
extraction machine-learning pdf
Last synced: 29 Jan 2025
https://github.com/gaborvecsei/pdf-split-merge
simple pdf file split and merge tool
pdf pdf-document pypdf2 python python3
Last synced: 28 Oct 2024
https://github.com/mbrn/filefy
A javascript library to produce downloadable files such as in CSV, PDF, XLSX, DOCX formats
csv excel export file-export javascript pdf word
Last synced: 19 Dec 2024
https://github.com/gaborvecsei/Pdf-Split-Merge
simple pdf file split and merge tool
pdf pdf-document pypdf2 python python3
Last synced: 27 Nov 2024
https://github.com/schorschii/simple-signer
Sign and certify PDF files on Linux with optional visual stamp using a .p12/.pfx certificate file
certificate certify p12 pdf sign signature
Last synced: 20 Dec 2024
https://github.com/linuxerwang/pdf-quench
A visual tool for cropping pdf files
Last synced: 26 Jan 2025
https://github.com/j3-tech/office-document-converter
Office Document Convertor (ODC) is an online convertor for office document which runs as a web service. Its aim is to provide the facility of converting almost all office documents into image which make office documents viewable even without any office suite software installed on your machines.
javascript odc office-document office-document-converter office-documents-viewable office-suite pdf
Last synced: 13 Dec 2024
https://github.com/tanaikech/pdfapp
This is a Google Apps Script library for managing PDFs.
gas gas-library google-apps-script google-apps-script-library googleappsscript library pdf pdf-forms pdf-generation pdf-management pdf-manager
Last synced: 11 Nov 2024
https://github.com/strukturag/pdfdraw
Nextcloud app to annotate PDF documents
app javascript nextcloud node pdf socket-io socketio
Last synced: 13 Nov 2024
https://github.com/wolfgangfahl/pdfindexer
Index and search PDF files using Apache Lucene and PDF Box
Last synced: 19 Dec 2024
https://github.com/rubensworks/scholarmarkdown
A framework for writing markdown-based scholarly articles.
article html markdown pdf scholarly-articles
Last synced: 02 Feb 2025
https://github.com/lfoppiano/structure-vision
Viewer for the structure extracted by Grobid on PDF documents
documents grobid hamburger-to-cow pdf streamlit structure
Last synced: 20 Dec 2024
https://github.com/augus1999/pypdfeditor-gui
A desktop application to merge, split, add watermark, rotate and rearrange pages, and set password to PDF files. 🖥️ pip install pyPDFeditor-GUI
desktop-application gui pdf pyqt6 python
Last synced: 29 Jan 2025
https://github.com/J3-Tech/Office-Document-Converter
Office Document Convertor (ODC) is an online convertor for office document which runs as a web service. Its aim is to provide the facility of converting almost all office documents into image which make office documents viewable even without any office suite software installed on your machines.
javascript odc office-document office-document-converter office-documents-viewable office-suite pdf
Last synced: 21 Nov 2024
https://github.com/captn3m0/cosmere-books
Script to build EPUB/MOBI/PDF versions of various books by Brandon Sanderson
brandon-sanderson calibre converts cosmere ebook-generator epub html mobi mythwalker pdf stormlightachives tor warbreaker
Last synced: 31 Dec 2024
https://github.com/bsorrentino/pdf-tools
Extract Markdown + Images from PDF
Last synced: 27 Oct 2024
https://github.com/siddhantsadangi/pdf-workdesk
A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.
pdf pdf-document pdf-document-processor pdf-files pdf-viewer pdfkit python streamlit webapp
Last synced: 27 Oct 2024
https://github.com/pxlrbt/laravel-pdfable
Keep your PDFs logic in one place.
laravel laravel-package pdf pdf-generation php
Last synced: 29 Jan 2025
https://github.com/thoglu/jammy_flows
A package to describe amortized (conditional) normalizing-flow PDFs defined jointly on tensor products of manifolds with coverage control. The connection between different manifolds is fixed via an autoregressive structure.
manifolds normalizing-flows pdf probability-distribution
Last synced: 17 Nov 2024
https://github.com/codedotjs/interviewcake
:unicorn: Read Interview Cake questions and solutions.
algorithms challenge challenges competitive-programming datastructures interview interview-practice interviewcake-questions javascript pdf pdf-files questions solve
Last synced: 18 Dec 2024
https://github.com/NielsSteensma/Dhalang
Generate PDFs and make screenshots of HTML using Puppeteer in Ruby
pdf pdf-generation puppeteer screenshot
Last synced: 24 Nov 2024
https://github.com/jp-fosterson/pandoc-novel
Markdown text to a novel in ePub and PDF.
epub latex markdown pandoc pdf writing writing-tool
Last synced: 16 Nov 2024
https://github.com/rimiti/html-invoice-template
Simple invoice html template
html invoice invoice-it order pdf template
Last synced: 28 Dec 2024
https://github.com/itext/itext-pdfsweep-java
Remove content from your digital documents irretrievably instead of just covering it up. Redact text, images, parts of images or drawings for complete confidentiality. iText pdfSweep complies with GDPR for data redaction.
Last synced: 19 Dec 2024
https://github.com/ashutoshvarma/pyxpdf
Fast and memory-efficient Python PDF Parser based on xpdf sources
cython pdf pdf-converter pdf-parser pdfparser pdftohtml pdftopng pdftotext python xpdf xpdf-reader
Last synced: 07 Nov 2024
https://github.com/ddomingof/pubmed2pdf
A Python package to download full article PDFs from OA publications
articles-google-scholar open-access pdf pubmed scientific-literature scientific-publications
Last synced: 30 Jan 2025
https://github.com/thelinuxuser-choice/anoncracker
A single tool to bruteforce pdf , zip and hashes very super fast tool developed with python3
accurate bruteforce hash-bruteforce hashes pdf pdf-bruteforce python3 superfast zip zip-bruteforce
Last synced: 10 Nov 2024
https://github.com/contributte/pdf
📮 PDF toolbox for Nette Framework using mPDF
contributte mpdf nette pdf php
Last synced: 29 Jan 2025
https://github.com/isuruwa/PDF-TOOLBOX
A Multi Purpose PDF Toolkit
pdf pdf-bruteforce pdf-decrypt pdf-encryption pdf-info pdf-merger pdf-splitter pdf-to-audio pdf-to-text pdf-tools pdf-watermark text-to-pdf
Last synced: 06 Nov 2024
https://github.com/andyclifton/accessibility
A CTAN-compliant version of the LaTeX `accessibility` package
accessibility latex-package pdf
Last synced: 11 Oct 2024
https://github.com/restorecommerce/pdf-rendering-srv
HTML to PDF rendering Microservice based on Chrome. Self-hosted service for rendering PDFs.
chrome grpc microservice pdf renderer rust
Last synced: 09 Nov 2024
https://github.com/itext/itext-pdfocr-dotnet
pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract
Last synced: 19 Dec 2024
https://github.com/nyg/htmlwithimagestopdf
Generate a PDF from an HTML file containing images (iOS).
html html-to-pdf ios pdf uimarkuptextprintformatter uiprintinteractioncontroller uiwebview wkwebview
Last synced: 12 Nov 2024
https://github.com/dipankardas011/pdf-editor
A web-based PDF Editor tool
argocd docker dockercompose express-js go golang hacktoberfest helm helm-charts html-css jest kubernetes nodejs npm pdf redis tekton-pipelines
Last synced: 21 Jan 2025
https://github.com/darkfriend/dev2fun.imagecompress
Оптимизация картинок - автоматически и без сторонних сервисов для 1С-Битрикс
avif bitrix gif image-compression jpeg module optimization pdf png solution svg webp
Last synced: 18 Dec 2024
https://github.com/mtgrosser/pdfunite
Merge PDF files with Ruby, no Java required
joining-pdf-files pdf pdf-files pdfunite ruby
Last synced: 07 Nov 2024
https://github.com/nurupo/paper-store
Cold store small files on paper as QR codes -- PGP keys, Bitcoin keys, Tox keys or any other small files in general.
bitcoin-wallet cold-storage gpg pdf pgp printing qr qr-code qrcode tox
Last synced: 11 Oct 2024
https://github.com/alhassy/agdacheatsheet
Basics of the dependently-typed functional language Agda ^_^
agda cheatsheet emacs html pdf
Last synced: 28 Oct 2024
https://github.com/mariosgkmeng/straightforward-obsidian2latex
A complete converter from Obsidian to Latex (write your scientific papers in Obsidian --> print to LateX)
automation latex obsidian pdf pdf-generation scientific-writing
Last synced: 22 Dec 2024
https://github.com/alhassy/AgdaCheatSheet
Basics of the dependently-typed functional language Agda ^_^
agda cheatsheet emacs html pdf
Last synced: 11 Nov 2024
https://github.com/kiwi0fruit/pyppdf
Pyppeteer PDF. Print html sites and files to pdf via pyppeteer (uses patched pyppeteer that by default downloads updated Chromium revision via https with certifi).
html-to-pdf html2pdf pdf puppeteer pyppeteer python
Last synced: 12 Oct 2024
https://github.com/AuburnSounds/printed
Generate PDF/SVG/HTML with D, with a Canvas-style API. Now with a flow document emitter.
canvas generation pdf printable svg vector-graphics
Last synced: 05 Nov 2024
https://github.com/danilofreire/rmarkdown-templates
A collection of personal templates for RMarkdown
article letter pdf rmarkdown rmarkdown-templates syllabus template title-page
Last synced: 25 Dec 2024
https://github.com/poteto/peedy
PDF watermarking microservice written in Elixir. For educational purposes only - not meant for production
Last synced: 20 Oct 2024
https://github.com/shivaprsd/doq
The missing reader mode/color schemes tool for PDFs. 👓
color-schemes dark-mode es6-module eye-comfort pdf pdfjs reader-mode
Last synced: 27 Oct 2024
https://github.com/z88dk/techdocs
Reference documents for z88dk
documentation pdf technical z180 z80 z88dk
Last synced: 16 Nov 2024
https://github.com/kermitt2/biblio_glutton_harvester
Open Access PDF harvester
harvester openaccess pdf pubmed-central s3-storage swift-storage unpaywall
Last synced: 31 Oct 2024
https://github.com/LudwigCRON/undulate
digital-timing-diagrams eps json pdf png svg timing-diagram wavedrom waveform
Last synced: 17 Nov 2024
https://github.com/mypebble/rest-framework-latex
A LaTeX renderer for Django REST Framework
django django-rest-framework latex latex-renderer pdf rest rest-framework tex
Last synced: 25 Nov 2024
https://github.com/docspedia/docspedia
Chat with your pdf using your local LLM, OLLAMA client.
chatbot chatwithdocs chatwithpdf docspedia document documents llm localllm ollama ollama-chat ollama-client pdf privategpt
Last synced: 01 Nov 2024
https://github.com/wenyalintw/google-patents-scraper
Automatically download all PDF files of searching results & their patent families found on Google Patents.
crawler google-patents patent patents pdf scraper scraping scrapy web-scraping
Last synced: 11 Nov 2024
https://github.com/kermitt2/biblio-glutton-harvester
Open Access PDF harvester
harvester openaccess pdf pubmed-central s3-storage swift-storage unpaywall
Last synced: 21 Dec 2024
https://github.com/compdfkit/compdfkit-api-java
A Java component library for integrating with ComPDFKit API to build a PDF Viewer and Editor.
api compdfkit-api java pdf pdf-converter pdf-document pdf-editor pdf-viewer
Last synced: 12 Nov 2024
https://github.com/peetzweg/papr
Command line tool to generate a PDF Calendars
cairo calendar calendaring manual pango paper pdf printer
Last synced: 05 Jan 2025
https://github.com/mindsdb/aipdf
A tool to OCR PDFs using gen-AI models
ai gpt groq hacktoberfest ocr ollama openai pdf
Last synced: 31 Jan 2025
https://github.com/navjotdhanawat/dynamic-html-pdf
Create dynamic PDF from HTML template using handlebars
content dynamic dynamic-html-pdf handlebar-template html pdf render
Last synced: 24 Nov 2024
https://github.com/pontedilana/weasyprintbundle
Easily create PDF in Symfony by converting HTML using WeasyPrint
bundle pdf pdf-generation php symfony symfony-bundle weasyprint
Last synced: 02 Feb 2025
https://github.com/randombenj/modern-latex-theme
A simple and modern LaTeX theme which can be used with pandoc
hacktoberfest latex latex-template modern modern-latex-theme pandoc pdf pdf-theme
Last synced: 08 Nov 2024
https://github.com/tanaikech/PDFApp
This is a Google Apps Script library for managing PDFs.
gas gas-library google-apps-script google-apps-script-library googleappsscript library pdf pdf-forms pdf-generation pdf-management pdf-manager
Last synced: 27 Oct 2024
https://github.com/aminya/tocpdf
Generates bookmarks from the table of contents already available at the beginning of pdf files.
bookmark bookmarker ocr outline pdf table-of-contents tableofcontents toc
Last synced: 13 Jan 2025
https://github.com/nucleos/nucleosdompdfbundle
📜 This bundle provides a wrapper for using dompdf inside symfony.
bundle dompdf hacktoberfest pdf php symfony symfony-bundle wrapper
Last synced: 23 Jan 2025
https://github.com/jghauser/auto-pandoc.nvim
Use pandoc to convert markdown files according to options from a yaml block
conversion lua markdown neovim nvim pandoc pdf plugin preview
Last synced: 27 Oct 2024
https://github.com/devleoko/painless-pdf
Painless PDF generation for node and the browser
browser nodejs pdf pdf-generation typescript
Last synced: 02 Jan 2025
https://github.com/norconex/importer
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
extract html java java-library manipulation norconex-importer parse pdf
Last synced: 31 Jan 2025
https://github.com/toolbuddy/docogen
A document tool within your masterpiece.
documentation-generator json latex-pdf markdown miktex pdf standalone
Last synced: 24 Nov 2024
https://github.com/navchandar/look-like-scanned
Python package to make documents look like they were scanned
image-to-pdf pdf pdf-scanner pdf-to-scan python scanner
Last synced: 27 Dec 2024
https://github.com/supermemo/SuperMemoAssistant.Plugins.PDF
Incremental PDF for SuperMemo
incremental incremental-reading knowledge pdf supermemo
Last synced: 11 Nov 2024
https://github.com/jasonkhew96/docs2pdf
Bulk convert word/powerpoint/excel file to pdf.
excel office pdf powerpoint visual-basic word
Last synced: 21 Jan 2025
https://github.com/javiercanon/pdf-oeditor.js
📑 HTML5 Javascript Online Offline Browser Editor 📜 for PDF files for insert sign, images, photos, comments or annotations
asp-net-core html5 insert-sign javascript js mozilla-pdf net-core pdf pdf-document pdf-files pdf-oeditor pdf-sign pdf-signature pdf-signer pdf-viewer
Last synced: 16 Nov 2024
https://github.com/baltpeter/scanprep
Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.
hacktoberfest image-processing pdf scanned-documents scanning
Last synced: 27 Dec 2024
https://github.com/goerz/bmconverter.py
Work with text files describing PDF bookmarks (stable)
Last synced: 11 Oct 2024
https://github.com/JavierCanon/PDF-OEditor.js
📑 HTML5 Javascript Online Offline Browser Editor 📜 for PDF files for insert sign, images, photos, comments or annotations
asp-net-core html5 insert-sign javascript js mozilla-pdf net-core pdf pdf-document pdf-files pdf-oeditor pdf-sign pdf-signature pdf-signer pdf-viewer
Last synced: 04 Dec 2024
https://github.com/electrovir/statement-parser
Parse bank and credit card statements
bank chase citi credit-card finances financial parser pdf pdf-to-json statement usaa
Last synced: 16 Oct 2024
https://github.com/icinga/icingaweb2-module-pdfexport
PDF export functionality for Icinga Web 2
Last synced: 15 Nov 2024
https://github.com/hhhrrrttt222111/resume-builder
Resume maker made using React.
material-ui pdf pdf-generation pdf-react react react-pdf reactjs resume resume-builder resume-creator
Last synced: 14 Dec 2024
https://github.com/termuxhackz/pdfbruter
This is an offline password cracking Tools which cracks any PDF which are password protected using an inbuilt wordlist Dictionary.!
bruteforce pdf pdf-cracker python3
Last synced: 21 Nov 2024
https://github.com/ColonelBuendia/rgpipe
lesspipe for ripgrep for common new filetypes using few dependencies
ebooks excel full-text-search indexing office pdf powerpoint ripgrep search word
Last synced: 06 Nov 2024
https://github.com/abbaye/rdlcprinter
Wpf ReportViewer control and RDLCPrinter work with SQL Server LocalReport. You can export your report to PDF, Word or PNG Image...
functionnality pdf printer rdl rdlc report reporting server sql usercontrol wpf
Last synced: 07 Nov 2024
https://github.com/shield-wall/myprofile
Generate your resume in markdown easily from Github actions ✅ using discussion section 📃 🚀
curriculum curriculum-vitae cv cv-template github-actions hacktoberfest job jobs markdown pdf pipelines resume resume-template
Last synced: 25 Nov 2024
https://github.com/cajuncoding/apachefop.serverless
A ready to use serverless implementation of Apache FOP via Azure Functions. This provides a micro-service for dynamically rendering quality PDF binary outputs from XSL-FO source using Apache FOP. When combined with the ease and simplicity of Azure Functions this project is a powerful, efficient, and scalable PDF Reporting Service that generates high quality, true paged media, reports for any environment and any client technology (.Net, NodeJS/JavaScript, Ruby, Mobile iOS/Android, Powershell, even Windows/Mac apps, etc.)!
apache-fop azure-functions fonet fop micro-service microservice pdf pdf-as-a-service pdf-files pdf-generation pdf-rendering pdf-templating reports reportserver reportservice serverless serverless-architectures xsl-fo xslfo xslt
Last synced: 28 Oct 2024
https://github.com/agentcooper/pdf-annotation-service
Client and service for embedding highlights into PDF documents
Last synced: 13 Nov 2024
https://github.com/pykong/pypergrabber
Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
crawler email-inbox google-scholar pdf pmid pubmed python sci-hub scraper
Last synced: 08 Nov 2024
https://github.com/chazeon/pdf-watermark-remover
Remove PDF watermarks from academic papers using pypdf
academic-papers pdf pdf-manipulation pdf-watermark pypdf2 watermark-remover
Last synced: 29 Nov 2024