https://github.com/fayazk/document-metadata-extractor
A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.
https://github.com/fayazk/document-metadata-extractor
content-indexing data-extraction document-management document-processing docx-parser excel-export gemini-ai-project generative-ai json-output metadata-extraction nlp pdf-parser python-automation text-analysis
Last synced: about 2 months ago
JSON representation
A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.
- Host: GitHub
- URL: https://github.com/fayazk/document-metadata-extractor
- Owner: FayazK
- Created: 2025-03-20T10:59:54.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2025-03-21T09:23:12.000Z (about 2 months ago)
- Last Synced: 2025-03-21T10:39:12.738Z (about 2 months ago)
- Topics: content-indexing, data-extraction, document-management, document-processing, docx-parser, excel-export, gemini-ai-project, generative-ai, json-output, metadata-extraction, nlp, pdf-parser, python-automation, text-analysis
- Language: Python
- Homepage: https://fayazk.com
- Size: 11.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0