Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/intellisense/tc_pdf
Thumbor PDF Preview
pdf pdf-preview python thumbor
Last synced: 03 Jul 2024
https://github.com/huysentruitw/print-it
Windows service for printing PDF files to a local or network printer in the background
api api-service background pdf pdfium printing windows-service
Last synced: 03 Jul 2024
https://github.com/vinay0410/tectonic-action
Compile Tex files easily
action github-action latex latex-action pdf tectonic tex xelatex
Last synced: 03 Jul 2024
https://github.com/pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 03 Jul 2024
https://github.com/RosiePuddles/mdbook-compress
An mdBook single PDF generator using pure Rust and some Node.js
mdbook pdf pdf-generation rust
Last synced: 02 Jul 2024
https://github.com/BetaHuhn/vercel-pdf-converter
📄▲ Vercel function which generates PDFs from Webpages.
pdf pdf-converter vercel vercel-pdf-converter vercel-serverless-functions
Last synced: 02 Jul 2024
https://github.com/RD17/ambar
:mag: Ambar: Document Search Engine
ambar ambar-search ocr pdf search search-engine search-in-text self-hosted
Last synced: 02 Jul 2024
https://github.com/camelot-dev/excalibur
A web interface to extract tabular data from PDFs
Last synced: 02 Jul 2024
https://github.com/AmirTugi/tea-school
Simplified HTML + CSS --> PDF Generator for Nodejs
css html nodejs pdf pdf-generation puppeteer
Last synced: 02 Jul 2024
https://github.com/schibsted/jsx-pdf
Generate PDFs using JSX! 🎯
jsx pdf pdf-generation pdfmake
Last synced: 02 Jul 2024
https://github.com/datasciencecampus/mobius
Scripts to extract data from the COVID-19 Google Community Mobility Reports
covid dsc-projects google mobility pdf python svg
Last synced: 01 Jul 2024
https://github.com/xhtml2pdf/xhtml2pdf
A library for converting HTML into PDFs using ReportLab
html-pdf html-pdf-converter html-to-pdf html-to-pdf-converter pdf pdf-converter pdf-generation pypdf python reportlab reportlab-pdf
Last synced: 30 Jun 2024
https://github.com/MSzturc/obsidian-better-pdf-plugin
Goal of this Plugin in to implement a native PDF handling workflow in Obsidian
obsidian-md obsidian-plugin pdf
Last synced: 29 Jun 2024
https://github.com/ollm/OpenComic
Comic and Manga reader, written with Node.js and using Electron
comic comic-viewer comics-reader electron epub-reader image-viewer javascript linux macos manga manga-reader manga-viewer manwha nodejs novel-reader pdf pdf-viewer webtoon webtoon-reader windows
Last synced: 29 Jun 2024
https://github.com/schild-report/schild.report
Desktop Anwendung zum Erstellen von Schild-Reports. Installationsdateien unter 'releases'.
dokument electron nrw pdf reporting reporting-engine reporting-tool schild schule svelte svelte-components zeugnisse
Last synced: 29 Jun 2024
https://github.com/danburzo/percollate
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
cli epub html markdown pdf puppeteer readability
Last synced: 29 Jun 2024
https://github.com/bhavita/Auto-Audio-Books
Convert pdf to audiobooks 📚
audiobooks google-text-to-speech pdf pdf-to-audiobook pytesseract-ocr
Last synced: 28 Jun 2024
https://github.com/nuhmanpk/pyDF-Bot
Pydf - Pyrogram Document File Bot, a modular Telegram Bot which provides Pdf Tools Works using Pypdf2
bot pdf pypdf pypdf2 pypdf2-lib pyrogram pyrogram-bot telegram telegram-bot tools
Last synced: 28 Jun 2024
https://github.com/m4mallu/compress-pdf-tg-bot
A Telegram bot can compress the size of PDF documents. Useful for digital documentations.
Last synced: 27 Jun 2024
https://github.com/marcbachmann/node-html-pdf
This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.
html pdf pdf-converter phantomjs
Last synced: 27 Jun 2024
https://github.com/humu2009/candy-pdf-reader
Native PDF reader based on pdf.js and node-webkit.
Last synced: 27 Jun 2024
https://github.com/Helias/EPUB-to-PDF
Telegram bot EPUB to PDF converter
converter epub pdf python-telegram-bot telegram telegram-bot
Last synced: 27 Jun 2024
https://github.com/kiwi0fruit/pyppdf
Pyppeteer PDF. Print html sites and files to pdf via pyppeteer (uses patched pyppeteer that by default downloads updated Chromium revision via https with certifi).
html-to-pdf html2pdf pdf puppeteer pyppeteer python
Last synced: 27 Jun 2024
https://github.com/Mathpix/mpx-cli
CLI for document conversion for scientific documents, powered by Mathpix OCR
cli converter document-conversion format-converter html latex markdown pdf science static-site-generator
Last synced: 27 Jun 2024
https://github.com/JustinTheWhale/PDF-Dark-Mode
Converts PDF's to have a grey background to be easier on the eyes
converter linux macos numba pdf pdf-converter pdf-document pdf-document-processor pillow poppler python python3 python310 python36 python37 python38 python39 windows
Last synced: 27 Jun 2024
https://github.com/tensult/cloud-reports
Scans your AWS cloud resources and generates reports. Check out free hosted version:
analyzer aws best-practices cloud cloud-computing html json pdf puppeteer reports scans security
Last synced: 27 Jun 2024
https://github.com/venthur/gscholar
Query Google Scholar with Python
bibtex cli google-scholar gscholar pdf python
Last synced: 27 Jun 2024
https://github.com/zhaoterryy/mkdocs-pdf-export-plugin
An MkDocs plugin to export content pages as PDF files
Last synced: 26 Jun 2024
https://github.com/jstockwin/py-pdf-parser
A Python tool to help extracting information from structured PDFs.
parsing pdf pdf-parsing py-pdf-parser
Last synced: 26 Jun 2024
https://github.com/Krasjet/pdf.tocgen
A CLI toolset to generate table of contents for PDF files automatically.
cli pdf pdf-document pdf-files pymupdf scraping table-of-contents toc-generator
Last synced: 26 Jun 2024
https://github.com/paulocoutinhox/pdfium-lib
PDFium - Project to compile PDFium library to multiple platforms.
chromium cpp google library pdf pdfium webassembly
Last synced: 26 Jun 2024
https://github.com/ether/etherpad-lite
Etherpad: A modern really-real-time collaborative document editor.
collaboration collaborative collaborative-editing collaborative-framework collaborative-research collaborative-writing document documents docx etherpad libreoffice microsoft pdf pdf-generation rich-text-editor video-conference video-conferencing web-editor word
Last synced: 25 Jun 2024
https://github.com/HendrixString/Android-PdfMyXml
Convert android XML layouts into PDF document, works on all versions of Android.
android android-pdfmyxml pdf pdf-generation
Last synced: 25 Jun 2024
https://github.com/sentinl/sentinl
Kibana Alert & Report App for Elasticsearch
alarm alert alerting anomaly anomaly-detection elastic elasticsearch elk kaae kibana kibana-dashboard kibi pdf plugin reporting scheduler timeseries visualization watchdog watcher
Last synced: 25 Jun 2024
https://github.com/doo/scanbot-sdk-ios-spm
barcode document image-filter image-processing ios mrz ocr pdf qr-code scanner sdk
Last synced: 25 Jun 2024
https://github.com/thomasWeise/bookbuildeR
An R Package for Building Books or Documents using pandoc
automation azw3 calibre continuous-integration docker ebooks epub3 html5 latex markdown pandoc pdf r texlive
Last synced: 25 Jun 2024
https://github.com/atlanhq/camelot
Camelot: PDF Table Extraction for Humans
Last synced: 25 Jun 2024
https://github.com/WZBSocialScienceCenter/pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
data-mining image-processing ocr pdf python tables
Last synced: 24 Jun 2024
https://github.com/mhucka/zowie
Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)
extended-attributes file-metadata finder macos metadata pdf pdf-files zotero zotero-api zotero-link
Last synced: 24 Jun 2024
https://github.com/kermitt2/biblio_glutton_harvester
Open Access PDF harvester
harvester openaccess pdf pubmed-central s3-storage swift-storage unpaywall
Last synced: 24 Jun 2024
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 24 Jun 2024
https://github.com/tobya/DocTo
Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF
conversion converter csv csv-export delphi delphi10 docto docx excel pdf pdf-generation windows word-documents xls xls-files xlsx
Last synced: 24 Jun 2024
https://github.com/sile-typesetter/casile
The CaSILE toolkit, a book publishing workflow employing SILE and other wizardry.
books casile ebook-generator epub-generation kindlegen pandoc pdf pdf-generation publishing sile typesetting typesetting-system
Last synced: 24 Jun 2024
https://github.com/jp-fosterson/pandoc-novel
Markdown text to a novel in ePub and PDF.
epub latex markdown pandoc pdf writing writing-tool
Last synced: 24 Jun 2024
https://github.com/booktype/Booktype
Booktype is a free, open source platform that produces beautiful, engaging books formatted for print, Amazon, iBooks and almost any ereader within minutes.
books booktype django ebooks editing epub ibooks javascript pdf pdf-converter python restful
Last synced: 24 Jun 2024
https://github.com/abhijithvijayan/nice-try-scribd
Unblur Scribd Documents...
chrome-extension firefox-addon pdf scribd unblur
Last synced: 24 Jun 2024
https://github.com/bernaferrari/ChangeDetection
Automatically track websites changes on Android in background.
android android-app android-application androidx architecture-components carousel dagger2 diff jsoup kotlin kotlin-android livedata material-design meyer paging-library pdf room room-persistence-library viewmodel
Last synced: 24 Jun 2024
https://github.com/siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
anki chatgpt electron evernote knowledge-base local-first markdown note-taking notebook notes-app notion obsidian ocr openai pdf pkm s3 self-hosted webdav
Last synced: 24 Jun 2024
https://github.com/JonathanLink/PDFLayoutTextStripper
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).
data-extraction extract java layout pdf pdfbox text
Last synced: 24 Jun 2024
https://github.com/skuola/pdf-text-parser
Library to parse XML resulting from pdftotext
Last synced: 24 Jun 2024
https://github.com/hy05190134/PdfTextExtract
used to parse pdf and extract text
Last synced: 24 Jun 2024
https://github.com/joseflaviojr/nheengatu
Produza e publique trabalhos com a linguagem popular Markdown. Escreva uma vez e crie livros digitais (e-book), artigos, trabalhos acadêmicos, relatórios, páginas web (site) --- PDF, EPUB, LaTeX, HTML, etc. --- Nheengatu é uma extensão da Pandoc.
epub html latex markdown pandoc pdf
Last synced: 24 Jun 2024
https://github.com/mgieseki/dvisvgm
A fast DVI, EPS, and PDF to SVG converter
dvi latex metafont pdf postscript svg tex vector-graphics xetex
Last synced: 24 Jun 2024
https://github.com/BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
alto alto-xml csharp docstrum document-layout-analysis hocr hocr-documents layout-analysis page-segmentation page-xml pdf pdfpig recursive-xy-cut table-extraction tei xy-cut xycut
Last synced: 24 Jun 2024
https://github.com/marcodebe/dicom-ecg-plot
Dicom ECG Viewer and Converter. Convert to PDF, PNG, JPG, SVG, ...
dicom ecg ecg-signal ecg-signal-python pdf python svg viewer wado waveform-visualizer
Last synced: 24 Jun 2024
https://github.com/cityssm/pdfFlattener
PDF Flattener - Secure PDF documents by making floating redactions and form entries permanent.
flatten java pdf pdf-flattener pdf-forms pdfbox redaction security
Last synced: 24 Jun 2024
https://github.com/weareprestatech/hotpdf
hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six
pdf python text-extraction text-search
Last synced: 23 Jun 2024
https://github.com/Akylas/OSS-DocumentScanner
Android document document scanning app
android document document-scan document-scan-to-text document-scanner document-scanner-app document-scanning image-processing opencv pdf scanner tesseract zxingcpp
Last synced: 22 Jun 2024
https://github.com/HashLoad/horse-octet-stream
octet-stream middleware for horse
horse horse-octet-stream middleware pdf stream
Last synced: 22 Jun 2024
https://github.com/EugenMayer/officeconverter
Convert file formats like docx, xlx to other formats like pdf, png - based on jodconverter and libreoffice
converter jodconverter libreoffice office pdf
Last synced: 22 Jun 2024
https://github.com/sinedied/backslide
:sweat_drops: CLI tool for making HTML presentations with Remark.js using Markdown
cli hacktoberfest html markdown nodejs pdf presentation remarkjs sass server slides slideshow utility
Last synced: 21 Jun 2024
https://github.com/clj-pdf/clj-pdf
PDF generation library for Clojure
clojure-library pdf pdf-generation
Last synced: 21 Jun 2024
https://github.com/ovvn/dom-to-pdf
Generates a printable paginated pdf from DOM node using HTML5 canvas and svg
Last synced: 21 Jun 2024
https://github.com/Yuras/pdf-toolbox
A collection of tools for processing PDF files in Haskell
Last synced: 20 Jun 2024
https://github.com/MatthiasValvekens/pyHanko
pyHanko: sign and stamp PDF files
digital-signature pades pdf pkcs11 signature
Last synced: 20 Jun 2024
https://github.com/mrzool/invoice-boilerplate
Simple automated LaTeX invoicing system
latex makefile pandoc pdf typesetting yaml
Last synced: 20 Jun 2024
https://github.com/mrzool/cv-boilerplate
Programmatic generation of high-quality CVs
latex makefile pandoc pdf typesetting yaml
Last synced: 20 Jun 2024
https://github.com/mrzool/letter-boilerplate
Finest letter typesetting from the command line
latex makefile markdown pandoc pdf typesetting xetex yaml
Last synced: 20 Jun 2024
https://github.com/maehr/academic-pandoc-template
Write beautifully typeset academic texts with distraction-free Markdown and Pandoc.
academic-publishing docx docx-generator github-action github-actions latex latex-template markdown markdown-converter pandoc pandoc-template pdf pdf-generator tectonic
Last synced: 20 Jun 2024
https://github.com/smddzcy/cv
A simple, modern CV / resume template.
beginner-project cv grunt html javascript pdf resume resume-creator resume-template
Last synced: 20 Jun 2024
https://github.com/HollowMan6/mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol. (用 Rust 编写的 mdBook 后端,基于headless chrome和Chrome开发工具协议生成PDF)
book chromium hacktoberfest mdbook pdf rust rust-lang
Last synced: 20 Jun 2024
https://github.com/exalted/PTShowcaseViewController
An initial implementation of a "showcase" view( controller) for iOS apps... Visualizes images, videos and PDF files beautifully! (by @pittleorg) [meta: image, photo, video, document, pdf, album, gallery, showcase, gallery, iOS, iPhone, iPad, component, library, viewer]
album document gallery image ios library pdf photo showcase video
Last synced: 20 Jun 2024
https://github.com/alhassy/ElispCheatSheet
Quick reference to the core language of Emacs ---Editor MACroS.
cheatsheet common-lisp emacs emacs-lisp html lisp pdf reference
Last synced: 19 Jun 2024
https://github.com/2mol/pboy
a small .pdf management tool with a command-line UI
command-line console-application haskell linux macos pdf productivity tui
Last synced: 19 Jun 2024
https://github.com/readium/mobile
📱 Readium Mobile is a toolkit for ebooks, audiobooks and comics written in Swift & Kotlin.
audiobooks ebooks kotlin opds pdf reading swift
Last synced: 19 Jun 2024
https://github.com/readium/swift-toolkit
A toolkit for ebooks, audiobooks and comics written in Swift
audiobook audiobook-player cbz ebook-reader epub ios opds opds-parser pdf pdf-viewer reader readium-lcp swift
Last synced: 19 Jun 2024
https://github.com/bepasty/bepasty-server
universal pastebin server
audio binary drag-and-drop flask highlight image pastebin pdf permissions python video
Last synced: 18 Jun 2024
https://github.com/SakhriHoussem/vue-resume
a live resume builder using VueJs
bootstrap bootstrap4 css3 fontawesome js markdown pdf resume resume-template simple-vue-validate vue vue-bootstrap vue-fontawesome vue-markdown vuejs
Last synced: 16 Jun 2024
https://github.com/alhassy/AgdaCheatSheet
Basics of the dependently-typed functional language Agda ^_^
agda cheatsheet emacs html pdf
Last synced: 16 Jun 2024
https://github.com/olivmonnier/moleculer-pdf
Moleculer service to generate PDF
microservices moleculer pdf puppeteer
Last synced: 15 Jun 2024
https://github.com/ismail-mebsout/Parsing-PDFs-using-YOLOV3
Parsing pdf tables using YOLOV3
Last synced: 15 Jun 2024