Projects in Awesome Lists tagged with pdfparser
A curated list of projects in awesome lists tagged with pdfparser .
https://github.com/lazyFrogLOL/llmdocparser
A package for parsing PDFs and analyzing their content using LLMs.
chunking document-analysis llm nlp ocr pdf-parser pdfparser rag text-chunking
Last synced: 01 Apr 2025
https://github.com/bobld/tabula-sharp
Extract tables from PDF files (port of tabula-java)
csharp dotnet extract extract-table extracting-tables extraction extraction-engine netstandard pdf-table-extract pdf-table-extraction pdfparser pdfpig pdfs table table-extraction tabula tabula-java tabula-sharp
Last synced: 15 May 2025
https://github.com/ashutoshvarma/pyxpdf
Fast and memory-efficient Python PDF Parser based on xpdf sources
cython pdf pdf-converter pdf-parser pdfparser pdftohtml pdftopng pdftotext python xpdf xpdf-reader
Last synced: 13 Jul 2025
https://github.com/bobld/camelot-sharp
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
camelot camelot-sharp csharp dotnet extract-table extracting-tables extraction extraction-engine netstandard opencv pdf-table-extract pdf-table-extraction pdfparser pdfpig pdfs table table-extraction
Last synced: 14 Jun 2025
https://github.com/parzibyte/extraer-texto-imagenes-pdf-php
Ejemplos de uso de PdfParser para extraer texto e imágenes de un documento PDF con PHP
Last synced: 12 Apr 2025
https://github.com/l2ysho/afpp
A fast, efficient, and minimal PDF parser for Node.js. Zero bloat. One dependency. Production-ready.
pdf pdfjs pdfparser pdftoimage pdftoimg pdftotext
Last synced: 01 Feb 2026
https://github.com/jsmatias/aiod-paper-metadata-extractor
A python service to retrieve metadata extract keywords from scientific papers
Last synced: 27 Nov 2025
https://github.com/ethannschwartz/gpt-api
Node.js implementation of OpenAI's GPT API.
gpt nodejs openai-api pdfparser
Last synced: 02 Aug 2025
https://github.com/rayeesrather99/notomatic
An AI-driven web app that generates structured notes from uploaded syllabi using OpenAI's API. Built with React, Node.js, Express.js, and MongoDB, it offers customizable note formats, downloads, and user notifications. Future updates include collaborative notes and LMS integration
gemini-api javascript jwt-authentication multer nodejs pdfparser reactjs tailwindcss textextracting
Last synced: 22 Feb 2025