Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with pdf-files

A curated list of projects in awesome lists tagged with pdf-files .

https://github.com/pdf-rs/pdf

Rust library to read, manipulate and write PDF files.

pdf pdf-files rust

Last synced: 17 Dec 2024

https://github.com/adithya-s-k/marker-api

Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.

api fastapi marker pdf-converter pdf-files pdf-parser pdf-parsing rest-api

Last synced: 20 Dec 2024

https://github.com/boazsegev/combine_pdf

A Pure ruby library to merge PDF files, number pages and maybe more...

pdf pdf-files pdf-generation pdf-merge ruby

Last synced: 16 Dec 2024

https://github.com/unidoc/unidoc

This repository has moved! https://github.com/unidoc/unipdf

golang pdf pdf-files pdf-invoice pdf-library text-extraction unidoc

Last synced: 02 Nov 2024

https://github.com/krasjet/pdf.tocgen

A CLI toolset to generate table of contents for PDF files automatically.

cli pdf pdf-document pdf-files pymupdf scraping table-of-contents toc-generator

Last synced: 22 Dec 2024

https://github.com/Krasjet/pdf.tocgen

A CLI toolset to generate table of contents for PDF files automatically.

cli pdf pdf-document pdf-files pymupdf scraping table-of-contents toc-generator

Last synced: 19 Nov 2024

https://github.com/ropensci/pdftools

Text Extraction, Rendering and Converting of PDF Documents

pdf-files pdf-format pdftools poppler poppler-library r r-package rstats text-extraction

Last synced: 11 Nov 2024

https://github.com/podofo/podofo

A C++17 PDF manipulation library

cplusplus cpp pdf pdf-documents pdf-files pdf-generation

Last synced: 15 Dec 2024

https://github.com/benwiggy/PDFsuite

Python scripts, Automator Services and Quartz Filters for MacOS (OS X) that create, manipulate, and query PDF files

apple booklet macos pdf-files pdf-service python quartz-filters

Last synced: 30 Oct 2024

https://github.com/arminstraub/krop

A simple graphical tool to crop the pages of PDF files, written in Python/Qt

crop ebook-readers pdf-files

Last synced: 30 Oct 2024

https://github.com/mhucka/zowie

Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)

extended-attributes file-metadata finder macos metadata pdf pdf-files zotero zotero-api zotero-link

Last synced: 19 Dec 2024

https://github.com/muriloventuroso/pdftricks

A simple, efficient application for small manipulations in PDF files using Ghostscript.

appcenter elementaryos ghostscript pdf pdf-files

Last synced: 12 Dec 2024

https://github.com/R0Wi-DEV/workflow_ocr

This is a Nextcloud Workflow App which enables you to process files via OCR on serverside.

nextcloud nextcloud-workflow-ocr ocr pdf-files

Last synced: 06 Nov 2024

https://github.com/naivehobo/pdfviewer

PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.

pdf pdf-document pdf-document-processor pdf-files pdf-viewer tkinter tkinter-graphic-interface tkinter-gui tkinter-library tkinter-python

Last synced: 09 Nov 2024

https://github.com/hrbrmstr/pdfbox

📄◻️ Create, Maniuplate and Extract Data from PDF Files (R Apache PDFBox wrapper)

pdf-document pdf-files pdfbox pdfbox-wrapper r r-cyber rstats

Last synced: 28 Oct 2024

https://github.com/siddhantsadangi/pdf-workdesk

A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.

pdf pdf-document pdf-document-processor pdf-files pdf-viewer pdfkit python streamlit webapp

Last synced: 27 Oct 2024

https://github.com/gorgitko/molminer

Python library and command-line tool for extracting compounds from scientific literature. Written in Python.

chemical-entities chemspider extract-entities named-entity-recognition natural-language-processing ner ocsr pdf-files pubchem python

Last synced: 06 Nov 2024

https://github.com/mtgrosser/pdfunite

Merge PDF files with Ruby, no Java required

joining-pdf-files pdf pdf-files pdfunite ruby

Last synced: 07 Nov 2024

https://github.com/javiercanon/pdf-oeditor.js

📑 HTML5 Javascript Online Offline Browser Editor 📜 for PDF files for insert sign, images, photos, comments or annotations

asp-net-core html5 insert-sign javascript js mozilla-pdf net-core pdf pdf-document pdf-files pdf-oeditor pdf-sign pdf-signature pdf-signer pdf-viewer

Last synced: 16 Nov 2024

https://github.com/JavierCanon/PDF-OEditor.js

📑 HTML5 Javascript Online Offline Browser Editor 📜 for PDF files for insert sign, images, photos, comments or annotations

asp-net-core html5 insert-sign javascript js mozilla-pdf net-core pdf pdf-document pdf-files pdf-oeditor pdf-sign pdf-signature pdf-signer pdf-viewer

Last synced: 04 Dec 2024

https://github.com/cajuncoding/apachefop.serverless

A ready to use serverless implementation of Apache FOP via Azure Functions. This provides a micro-service for dynamically rendering quality PDF binary outputs from XSL-FO source using Apache FOP. When combined with the ease and simplicity of Azure Functions this project is a powerful, efficient, and scalable PDF Reporting Service that generates high quality, true paged media, reports for any environment and any client technology (.Net, NodeJS/JavaScript, Ruby, Mobile iOS/Android, Powershell, even Windows/Mac apps, etc.)!

apache-fop azure-functions fonet fop micro-service microservice pdf pdf-as-a-service pdf-files pdf-generation pdf-rendering pdf-templating reports reportserver reportservice serverless serverless-architectures xsl-fo xslfo xslt

Last synced: 28 Oct 2024

https://github.com/cajuncoding/pdftemplating.xslfo

This is a C# .NET solution that tests and illustrates the capabilities of Xsl-FO for dynamically generating PDF documents. It provides a way to generate the XSL-FO source xml from Xslt or Razor Templating to address various development teams. And it provides basic WinForms Application that can be used to aid in developing the Xslt and Xsl-FO markup.

apache-fop azure-functions csharp fonet fop java microservice pdf-files pdf-generation pdf-rendering pdf-reports pdf-templating razor-templates razor-views serverless template-engine xsl-fo xslfo xslt xslt-template

Last synced: 27 Oct 2024

https://github.com/wbthomason/pdf-scribe.nvim

Neovim plugin for importing annotations and metadata from PDFs

annotations luajit neovim neovim-plugin notes nvim pdf pdf-files pdf-scribe poppler

Last synced: 11 Oct 2024

https://github.com/bronson/pdfdir

Utilities to operate on lots of PDF files

pdf pdf-files

Last synced: 20 Dec 2024

https://github.com/kairavkkp/merge-pdf

My first PyPi Package. Merge Image and PDF files using customizations within a folder using the Command line.

epubs hacktoberfest hacktoberfest2021 merge-images merge-pdf merge-pdf-images pdf-files pdfs

Last synced: 27 Oct 2024

https://github.com/piomin/sample-jasperreport-boot

sample application that show how to generate large pdf files using spring-boot with jasperreports

jasperreports java pdf pdf-files pdf-generation performance spring-boot

Last synced: 27 Oct 2024

https://github.com/josee9988/compress-pdfs

A python CLI script to 𝗰𝗼𝗺𝗽𝗿𝗲𝘀𝘀 📦 all the 𝗣𝗗𝗙 files 𝗿𝗲𝗰𝘂𝗿𝘀𝗶𝘃𝗲𝗹𝘆 in a directory using the iLovePDF technology 🥰

compress compress-files compress-pdf compressed compression compressor compressors pdf pdf-compression pdf-converter pdf-document-processor pdf-files python python-3 python-compressing python-pdf python3 python3-script python3-scripts size-reduction

Last synced: 12 Oct 2024

https://github.com/george-gca/ai_papers_scrapper

Download papers pdfs and other info from main AI conferences

artificial-intelligence conferences pdf-files python scraping scrapy

Last synced: 28 Oct 2024

https://github.com/bbc-esq/pyside6_pdf_viewer

Simple PDF Viewer Using Pyside6 that can be run standalone or included in a larger program.

pdf pdf-document pdf-files pdf-viewer pdfjs pyside6

Last synced: 12 Nov 2024

https://github.com/orchetect/pdfgadget

Batch PDF operations for Swift

pdf pdf-document-processor pdf-files pdf-merger swift

Last synced: 20 Oct 2024

https://github.com/lstedmanfalls/translatedocument

TypeScript node project to translate documents and export translation

pdf-files translation typescript

Last synced: 13 Nov 2024

https://github.com/yifaneye/1pdf

💫 CLI tool for combining all PDF files in a directory into 1 PDF file 👉 sudo npm i -g 1pdf

combine merge pdf pdf-document pdf-files

Last synced: 07 Nov 2024

https://github.com/shosta/merge-pdfs

Merge Pdfs automatically in a folder based on the file names.

pdf-files python scanner utility

Last synced: 21 Dec 2024

https://github.com/jabalazs/pdfly

Basic CLI tool for extracting and merging PDFs

pdf pdf-files

Last synced: 09 Nov 2024

https://github.com/thomasvanholder/browserless

A Ruby wrapper for the Browserless PDF API with support for modern CSS such as TailwindCSS

pdf pdf-converter pdf-document pdf-document-processor pdf-files pdf-generation

Last synced: 10 Nov 2024

https://github.com/moindalvs/how_to_convert_pdf_to_docx_in_python

How to convert .pdf extension files into .docx file in python?

docx pdf pdf-converter pdf-files pdf2docx

Last synced: 17 Nov 2024

https://github.com/drmccoy/pdftextorizer

Interactively extract text from multi-column PDFs

gui pdf pdf-extractor pdf-files pdf2text pdftotext pyqt5 qt5

Last synced: 12 Oct 2024

https://github.com/jeff-tian/fc-doc-rotary

[doc-rotary](https://github.com/Jeff-Tian/doc-rotary) 的阿里云函数计算版本。

aliyun-fc aliyun-fun doc-giggle docker document fc function-compute libreoffice pdf pdf-converter pdf-document pdf-files pdf-generation python typescript

Last synced: 15 Nov 2024

https://github.com/parzibyte/extraer-texto-imagenes-pdf-php

Ejemplos de uso de PdfParser para extraer texto e imágenes de un documento PDF con PHP

pdf pdf-files pdfparser php

Last synced: 25 Nov 2024

https://github.com/kouisamine/compare-two-pdf-files

Comparing PDFs can be useful in various scenarios, such as document version control, content verification, and collaboration.

comparator compare compare-pdf comparison js online pdf pdf-comparison pdf-diff pdf-difference pdf-document pdf-files php script source-code tools

Last synced: 12 Nov 2024

https://github.com/tfawcett/rename_papers

rename_papers is a simple program to extract the title from a PDF file, manipulate it, and rename the file.

pdf-files python renamer

Last synced: 06 Nov 2024

https://github.com/katahiromz/python-cheatsheet-japanese

Pythonカンニングペーパー(日本語)

cheat-sheets cheatsheets japanese pdf-files python svg-files

Last synced: 13 Nov 2024

https://github.com/geo-y20/enhanced-learning-experience

IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.

chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding

Last synced: 19 Dec 2024

https://github.com/hugoamoraes/contadorpdf

A small system that counts the number of pages in a PDF

javascript pdf pdf-files

Last synced: 12 Nov 2024

https://github.com/omaxel/pdf-overlap

Overlaps two PDF files.

csharp itextsharp pdf pdf-files pdf-overlap

Last synced: 19 Nov 2024

https://github.com/ejdecena/my-technical-docs

My Technical Docs is a repository where additional material for technical computer learning is collected.

docs documentation pdf-document pdf-files

Last synced: 11 Dec 2024

https://github.com/feliwir/charta-pdf

A modern C++ library for processing PDF documents.

cpp17 pdf pdf-files pdf-generation

Last synced: 07 Nov 2024

https://github.com/secret-guest/font_lister

List the fonts that are installed on your computer as a PDF list with corresponding images.

font fonts front-end-development frontend list listes lists-python listsort otf otf-fonts pdf pdf-files pdf-generation police python ttf ttf-fonts

Last synced: 10 Nov 2024

https://github.com/madmax2506/pdf-tools

Operations to handle some operations on pdf files.

pdf pdf-files pdf-generation pdf-merge pdf-rotate pdf-split

Last synced: 11 Nov 2024

https://github.com/xxdashtixx/pdfmultitool

PDF Multi-Tool is a .NET Framework Windows Forms application written in C# that provides functionalities for PDF conversions using the GhostScript library, Supporting conversions between various file formats and offering a user-friendly interface for managing and configuring conversion tasks.

c-sharp csharp dotnet ghostscript graphical-user-interface gui open-source pdf pdf-converter pdf-document pdf-files pdf-generation pdf-viewer user-interface windows-forms

Last synced: 28 Nov 2024