https://github.com/vbprojects/powerpoint-search-engine
Uses pyTesseract, Spacy, ChatGPT, and Python to create an Search Engine for contents of multiple powerpoint presentations, returning the relevant slides of a query even if the text is images.
https://github.com/vbprojects/powerpoint-search-engine
Last synced: 2 months ago
JSON representation
Uses pyTesseract, Spacy, ChatGPT, and Python to create an Search Engine for contents of multiple powerpoint presentations, returning the relevant slides of a query even if the text is images.
- Host: GitHub
- URL: https://github.com/vbprojects/powerpoint-search-engine
- Owner: vbprojects
- Created: 2024-01-25T21:01:44.000Z (about 2 years ago)
- Default Branch: master
- Last Pushed: 2024-03-19T02:29:07.000Z (almost 2 years ago)
- Last Synced: 2025-03-12T13:11:58.200Z (11 months ago)
- Language: Jupyter Notebook
- Size: 751 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Replicating an old tool I made, for an example of it functioning look at the example notebook.