Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/salman-khan-mohammed/image-captioning-to-speech-generation
This project develops a web app integrating image captioning and text-to-speech features. Using CNNs for image feature extraction and GRUs for generating captions, users can upload images via a Streamlit interface. Captions are created and converted into speech using gTTS, providing a tool for visually impaired users and image-to-text conversion.
https://github.com/salman-khan-mohammed/image-captioning-to-speech-generation
captions cnn feature-extraction generation gru gtts image-processing image-recognition streamlit
Last synced: about 1 month ago
JSON representation
This project develops a web app integrating image captioning and text-to-speech features. Using CNNs for image feature extraction and GRUs for generating captions, users can upload images via a Streamlit interface. Captions are created and converted into speech using gTTS, providing a tool for visually impaired users and image-to-text conversion.
- Host: GitHub
- URL: https://github.com/salman-khan-mohammed/image-captioning-to-speech-generation
- Owner: Salman-Khan-Mohammed
- Created: 2024-09-15T18:50:46.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2024-09-15T19:17:38.000Z (2 months ago)
- Last Synced: 2024-10-12T23:42:37.364Z (about 1 month ago)
- Topics: captions, cnn, feature-extraction, generation, gru, gtts, image-processing, image-recognition, streamlit
- Language: Jupyter Notebook
- Homepage:
- Size: 1.01 GB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md