https://github.com/fayazk/document-metadata-extractor
A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.
https://github.com/fayazk/document-metadata-extractor
content-indexing data-extraction document-management document-processing docx-parser excel-export gemini-ai-project generative-ai json-output metadata-extraction nlp pdf-parser python-automation text-analysis
Last synced: about 1 year ago
JSON representation
A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.
- Host: GitHub
- URL: https://github.com/fayazk/document-metadata-extractor
- Owner: FayazK
- Created: 2025-03-20T10:59:54.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-21T09:23:12.000Z (about 1 year ago)
- Last Synced: 2025-03-21T10:39:12.738Z (about 1 year ago)
- Topics: content-indexing, data-extraction, document-management, document-processing, docx-parser, excel-export, gemini-ai-project, generative-ai, json-output, metadata-extraction, nlp, pdf-parser, python-automation, text-analysis
- Language: Python
- Homepage: https://fayazk.com
- Size: 11.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0