Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/intellisense/tc_pdf

Thumbor PDF Preview

pdf pdf-preview python thumbor

Last synced: 03 Jul 2024

https://github.com/huysentruitw/print-it

Windows service for printing PDF files to a local or network printer in the background

api api-service background pdf pdfium printing windows-service

Last synced: 03 Jul 2024

https://github.com/pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps

Last synced: 03 Jul 2024

https://github.com/RosiePuddles/mdbook-compress

An mdBook single PDF generator using pure Rust and some Node.js

mdbook pdf pdf-generation rust

Last synced: 02 Jul 2024

https://github.com/BetaHuhn/vercel-pdf-converter

📄▲ Vercel function which generates PDFs from Webpages.

pdf pdf-converter vercel vercel-pdf-converter vercel-serverless-functions

Last synced: 02 Jul 2024

https://github.com/RD17/ambar

:mag: Ambar: Document Search Engine

ambar ambar-search ocr pdf search search-engine search-in-text self-hosted

Last synced: 02 Jul 2024

https://github.com/camelot-dev/excalibur

A web interface to extract tabular data from PDFs

extract for-humans pdf table

Last synced: 02 Jul 2024

https://github.com/AmirTugi/tea-school

Simplified HTML + CSS --> PDF Generator for Nodejs

css html nodejs pdf pdf-generation puppeteer

Last synced: 02 Jul 2024

https://github.com/schibsted/jsx-pdf

Generate PDFs using JSX! 🎯

jsx pdf pdf-generation pdfmake

Last synced: 02 Jul 2024

https://github.com/datasciencecampus/mobius

Scripts to extract data from the COVID-19 Google Community Mobility Reports

covid dsc-projects google mobility pdf python svg

Last synced: 01 Jul 2024

https://github.com/sainnhe/caj2pdf-qt

CAJ 转 PDF 转换器(GUI 版本)

caj pdf

Last synced: 01 Jul 2024

https://github.com/MSzturc/obsidian-better-pdf-plugin

Goal of this Plugin in to implement a native PDF handling workflow in Obsidian

obsidian-md obsidian-plugin pdf

Last synced: 29 Jun 2024

https://github.com/typst/svg2pdf

Converts SVG files to PDF.

pdf svg

Last synced: 29 Jun 2024

https://github.com/schild-report/schild.report

Desktop Anwendung zum Erstellen von Schild-Reports. Installationsdateien unter 'releases'.

dokument electron nrw pdf reporting reporting-engine reporting-tool schild schule svelte svelte-components zeugnisse

Last synced: 29 Jun 2024

https://github.com/danburzo/percollate

A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.

cli epub html markdown pdf puppeteer readability

Last synced: 29 Jun 2024

https://github.com/nuhmanpk/pyDF-Bot

Pydf - Pyrogram Document File Bot, a modular Telegram Bot which provides Pdf Tools Works using Pypdf2

bot pdf pypdf pypdf2 pypdf2-lib pyrogram pyrogram-bot telegram telegram-bot tools

Last synced: 28 Jun 2024

https://github.com/m4mallu/compress-pdf-tg-bot

A Telegram bot can compress the size of PDF documents. Useful for digital documentations.

compress pdf telegram-bot

Last synced: 27 Jun 2024

https://github.com/marcbachmann/node-html-pdf

This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.

html pdf pdf-converter phantomjs

Last synced: 27 Jun 2024

https://github.com/humu2009/candy-pdf-reader

Native PDF reader based on pdf.js and node-webkit.

node-webkit pdf

Last synced: 27 Jun 2024

https://github.com/Helias/EPUB-to-PDF

Telegram bot EPUB to PDF converter

converter epub pdf python-telegram-bot telegram telegram-bot

Last synced: 27 Jun 2024

https://github.com/kiwi0fruit/pyppdf

Pyppeteer PDF. Print html sites and files to pdf via pyppeteer (uses patched pyppeteer that by default downloads updated Chromium revision via https with certifi).

html-to-pdf html2pdf pdf puppeteer pyppeteer python

Last synced: 27 Jun 2024

https://github.com/Mathpix/mpx-cli

CLI for document conversion for scientific documents, powered by Mathpix OCR

cli converter document-conversion format-converter html latex markdown pdf science static-site-generator

Last synced: 27 Jun 2024

https://github.com/tensult/cloud-reports

Scans your AWS cloud resources and generates reports. Check out free hosted version:

analyzer aws best-practices cloud cloud-computing html json pdf puppeteer reports scans security

Last synced: 27 Jun 2024

https://github.com/venthur/gscholar

Query Google Scholar with Python

bibtex cli google-scholar gscholar pdf python

Last synced: 27 Jun 2024

https://github.com/ECNUdase/pdf

数据科学方向 课件&资料

pdf ppt

Last synced: 26 Jun 2024

https://github.com/zhaoterryy/mkdocs-pdf-export-plugin

An MkDocs plugin to export content pages as PDF files

mkdocs pdf

Last synced: 26 Jun 2024

https://github.com/jstockwin/py-pdf-parser

A Python tool to help extracting information from structured PDFs.

parsing pdf pdf-parsing py-pdf-parser

Last synced: 26 Jun 2024

https://github.com/Krasjet/pdf.tocgen

A CLI toolset to generate table of contents for PDF files automatically.

cli pdf pdf-document pdf-files pymupdf scraping table-of-contents toc-generator

Last synced: 26 Jun 2024

https://github.com/paulocoutinhox/pdfium-lib

PDFium - Project to compile PDFium library to multiple platforms.

chromium cpp google library pdf pdfium webassembly

Last synced: 26 Jun 2024

https://github.com/HendrixString/Android-PdfMyXml

Convert android XML layouts into PDF document, works on all versions of Android.

android android-pdfmyxml pdf pdf-generation

Last synced: 25 Jun 2024

https://github.com/baskerville/plato

Document reader

djvu e-ink e-reader epub kobo pdf

Last synced: 25 Jun 2024

https://github.com/thomasWeise/bookbuildeR

An R Package for Building Books or Documents using pandoc

automation azw3 calibre continuous-integration docker ebooks epub3 html5 latex markdown pandoc pdf r texlive

Last synced: 25 Jun 2024

https://github.com/atlanhq/camelot

Camelot: PDF Table Extraction for Humans

extract for-humans pdf table

Last synced: 25 Jun 2024

https://github.com/mkb79/audible-cli

A command line interface for audible package. With the cli you can download your Audible books, cover, chapter files.

aax aaxc chapter cover download pdf profiles python subcommands toml

Last synced: 24 Jun 2024

https://github.com/WZBSocialScienceCenter/pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

data-mining image-processing ocr pdf python tables

Last synced: 24 Jun 2024

https://github.com/thomasWeise/bookbuilderpy

A python-based environment for the automated compilation of books from markdown.

e-books html latex markdown pandoc pdf python

Last synced: 24 Jun 2024

https://github.com/mhucka/zowie

Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)

extended-attributes file-metadata finder macos metadata pdf pdf-files zotero zotero-api zotero-link

Last synced: 24 Jun 2024

https://github.com/tobya/DocTo

Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF

conversion converter csv csv-export delphi delphi10 docto docx excel pdf pdf-generation windows word-documents xls xls-files xlsx

Last synced: 24 Jun 2024

https://github.com/sile-typesetter/casile

The CaSILE toolkit, a book publishing workflow employing SILE and other wizardry.

books casile ebook-generator epub-generation kindlegen pandoc pdf pdf-generation publishing sile typesetting typesetting-system

Last synced: 24 Jun 2024

https://github.com/jp-fosterson/pandoc-novel

Markdown text to a novel in ePub and PDF.

epub latex markdown pandoc pdf writing writing-tool

Last synced: 24 Jun 2024

https://github.com/koreader/koreader-base

Base framework offering a Lua scriptable environment for creating document readers

djvu emulator epub ffi koreader leptonica lua luajit mupdf pdf sdl tesseract ubuntu

Last synced: 24 Jun 2024

https://github.com/booktype/Booktype

Booktype is a free, open source platform that produces beautiful, engaging books formatted for print, Amazon, iBooks and almost any ereader within minutes.

books booktype django ebooks editing epub ibooks javascript pdf pdf-converter python restful

Last synced: 24 Jun 2024

https://github.com/siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

anki chatgpt electron evernote knowledge-base local-first markdown note-taking notebook notes-app notion obsidian ocr openai pdf pkm s3 self-hosted webdav

Last synced: 24 Jun 2024

https://github.com/JonathanLink/PDFLayoutTextStripper

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).

data-extraction extract java layout pdf pdfbox text

Last synced: 24 Jun 2024

https://github.com/aesiniath/publish

Publishing tools for papers, books, and presentations

haskell latex markdown pdf rendering

Last synced: 24 Jun 2024

https://github.com/skuola/pdf-text-parser

Library to parse XML resulting from pdftotext

pdf php

Last synced: 24 Jun 2024

https://github.com/hy05190134/PdfTextExtract

used to parse pdf and extract text

golang pdf

Last synced: 24 Jun 2024

https://github.com/bastibense/libharu_ng

Generate PDFs using libharu from Rust

generator haru libharu pdf rust wrapper

Last synced: 24 Jun 2024

https://github.com/wikiti/pandoc-book-template

A simple Pandoc template to build documents and ebooks.

ebook epub html markdown pandoc pdf template

Last synced: 24 Jun 2024

https://github.com/joseflaviojr/nheengatu

Produza e publique trabalhos com a linguagem popular Markdown. Escreva uma vez e crie livros digitais (e-book), artigos, trabalhos acadêmicos, relatórios, páginas web (site) --- PDF, EPUB, LaTeX, HTML, etc. --- Nheengatu é uma extensão da Pandoc.

epub html latex markdown pandoc pdf

Last synced: 24 Jun 2024

https://github.com/mgieseki/dvisvgm

A fast DVI, EPS, and PDF to SVG converter

dvi latex metafont pdf postscript svg tex vector-graphics xetex

Last synced: 24 Jun 2024

https://github.com/marcodebe/dicom-ecg-plot

Dicom ECG Viewer and Converter. Convert to PDF, PNG, JPG, SVG, ...

dicom ecg ecg-signal ecg-signal-python pdf python svg viewer wado waveform-visualizer

Last synced: 24 Jun 2024

https://github.com/cityssm/pdfFlattener

PDF Flattener - Secure PDF documents by making floating redactions and form entries permanent.

flatten java pdf pdf-flattener pdf-forms pdfbox redaction security

Last synced: 24 Jun 2024

https://github.com/weareprestatech/hotpdf

hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six

pdf python text-extraction text-search

Last synced: 23 Jun 2024

https://github.com/HashLoad/horse-octet-stream

octet-stream middleware for horse

horse horse-octet-stream middleware pdf stream

Last synced: 22 Jun 2024

https://github.com/EugenMayer/officeconverter

Convert file formats like docx, xlx to other formats like pdf, png - based on jodconverter and libreoffice

converter jodconverter libreoffice office pdf

Last synced: 22 Jun 2024

https://github.com/hadyang/interview

Java 笔试、面试 知识整理

android gitbook java pdf

Last synced: 21 Jun 2024

https://github.com/sinedied/backslide

:sweat_drops: CLI tool for making HTML presentations with Remark.js using Markdown

cli hacktoberfest html markdown nodejs pdf presentation remarkjs sass server slides slideshow utility

Last synced: 21 Jun 2024

https://github.com/Belval/pdf2image

A python module that wraps the pdftoppm utility to convert PDF to PIL Image object

convert pdf pil pil-image poppler

Last synced: 21 Jun 2024

https://github.com/clj-pdf/clj-pdf

PDF generation library for Clojure

clojure-library pdf pdf-generation

Last synced: 21 Jun 2024

https://github.com/ovvn/dom-to-pdf

Generates a printable paginated pdf from DOM node using HTML5 canvas and svg

dom html javascript pdf

Last synced: 21 Jun 2024

https://github.com/ehamberg/typeclassopedia-md

Markdown version of the Haskell Typeclassopedia

book epub haskell pandoc pdf

Last synced: 20 Jun 2024

https://github.com/Yuras/pdf-toolbox

A collection of tools for processing PDF files in Haskell

haskell pdf

Last synced: 20 Jun 2024

https://github.com/MatthiasValvekens/pyHanko

pyHanko: sign and stamp PDF files

digital-signature pades pdf pkcs11 signature

Last synced: 20 Jun 2024

https://github.com/mrzool/invoice-boilerplate

Simple automated LaTeX invoicing system

latex makefile pandoc pdf typesetting yaml

Last synced: 20 Jun 2024

https://github.com/mrzool/cv-boilerplate

Programmatic generation of high-quality CVs

latex makefile pandoc pdf typesetting yaml

Last synced: 20 Jun 2024

https://github.com/mrzool/letter-boilerplate

Finest letter typesetting from the command line

latex makefile markdown pandoc pdf typesetting xetex yaml

Last synced: 20 Jun 2024

https://github.com/smddzcy/cv

A simple, modern CV / resume template.

beginner-project cv grunt html javascript pdf resume resume-creator resume-template

Last synced: 20 Jun 2024

https://github.com/HollowMan6/mdbook-pdf

A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol. (用 Rust 编写的 mdBook 后端,基于headless chrome和Chrome开发工具协议生成PDF)

book chromium hacktoberfest mdbook pdf rust rust-lang

Last synced: 20 Jun 2024

https://github.com/exalted/PTShowcaseViewController

An initial implementation of a "showcase" view( controller) for iOS apps... Visualizes images, videos and PDF files beautifully! (by @pittleorg) [meta: image, photo, video, document, pdf, album, gallery, showcase, gallery, iOS, iPhone, iPad, component, library, viewer]

album document gallery image ios library pdf photo showcase video

Last synced: 20 Jun 2024

https://github.com/ddzy/fe-necessary-book

:bowtie: A book and software collection about frontend

computer css frontend healthy html http js pdf programmer software

Last synced: 20 Jun 2024

https://github.com/Kozea/CairoSVG

Convert your vector images

cairo converter pdf png python svg

Last synced: 19 Jun 2024

https://github.com/alhassy/ElispCheatSheet

Quick reference to the core language of Emacs ---Editor MACroS.

cheatsheet common-lisp emacs emacs-lisp html lisp pdf reference

Last synced: 19 Jun 2024

https://github.com/2mol/pboy

a small .pdf management tool with a command-line UI

command-line console-application haskell linux macos pdf productivity tui

Last synced: 19 Jun 2024

https://github.com/readium/mobile

📱 Readium Mobile is a toolkit for ebooks, audiobooks and comics written in Swift & Kotlin.

audiobooks ebooks kotlin opds pdf reading swift

Last synced: 19 Jun 2024

https://github.com/readium/kotlin-toolkit

A toolkit for ebooks, audiobooks and comics written in Kotlin

android audiobook cbz epub kotlin pdf reader

Last synced: 19 Jun 2024

https://github.com/readium/swift-toolkit

A toolkit for ebooks, audiobooks and comics written in Swift

audiobook audiobook-player cbz ebook-reader epub ios opds opds-parser pdf pdf-viewer reader readium-lcp swift

Last synced: 19 Jun 2024

https://github.com/yezz123/My-Business

Business management tool featuring accounts, invoices, partners, projects, and server 🦄

application buisness certbot clipboard django fqdn framework gunicorn gunicorn-socket invoices nginx partners pdf python python3

Last synced: 16 Jun 2024

https://github.com/JakubMelka/PDF4QT

Open source PDF editor.

pdf pdf-editor pdf-viewer

Last synced: 16 Jun 2024

https://github.com/alhassy/AgdaCheatSheet

Basics of the dependently-typed functional language Agda ^_^

agda cheatsheet emacs html pdf

Last synced: 16 Jun 2024

https://github.com/olivmonnier/moleculer-pdf

Moleculer service to generate PDF

microservices moleculer pdf puppeteer

Last synced: 15 Jun 2024

https://github.com/ismail-mebsout/Parsing-PDFs-using-YOLOV3

Parsing pdf tables using YOLOV3

computer-vision pdf python

Last synced: 15 Jun 2024