An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with content-indexing

A curated list of projects in awesome lists tagged with content-indexing .

https://github.com/fayazk/document-metadata-extractor

A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.

content-indexing data-extraction document-management document-processing docx-parser excel-export gemini-ai-project generative-ai json-output metadata-extraction nlp pdf-parser python-automation text-analysis

Last synced: 01 Apr 2025