Projects in Awesome Lists tagged with content-indexing
A curated list of projects in awesome lists tagged with content-indexing .
https://github.com/fayazk/document-metadata-extractor
A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.
content-indexing data-extraction document-management document-processing docx-parser excel-export gemini-ai-project generative-ai json-output metadata-extraction nlp pdf-parser python-automation text-analysis
Last synced: 01 Apr 2025