https://github.com/undp-data/dsc-energy-text-extraction
The data pipeline to process pdf and other documents to extract and clean the text database, for the Energy Moonshot AI
https://github.com/undp-data/dsc-energy-text-extraction
Last synced: 5 months ago
JSON representation
The data pipeline to process pdf and other documents to extract and clean the text database, for the Energy Moonshot AI
- Host: GitHub
- URL: https://github.com/undp-data/dsc-energy-text-extraction
- Owner: UNDP-Data
- Created: 2023-12-20T00:34:19.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-06-17T07:54:57.000Z (about 2 years ago)
- Last Synced: 2025-08-18T03:40:19.374Z (10 months ago)
- Language: Jupyter Notebook
- Size: 297 MB
- Stars: 2
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md