https://github.com/bhoomikaniranjan/depiction-of-image-features-with-audio-to-aid-visually-impaired-persons
This project transforms visual content into vivid audio narratives for visually impaired individuals. Using advanced image recognition and text-to-speech technologies, it generates detailed captions and provides audio output in English, Kannada, and Hindi, fostering inclusivity and independence.
https://github.com/bhoomikaniranjan/depiction-of-image-features-with-audio-to-aid-visually-impaired-persons
image-captioning language-translation pillow python python-project
Last synced: 4 months ago
JSON representation
This project transforms visual content into vivid audio narratives for visually impaired individuals. Using advanced image recognition and text-to-speech technologies, it generates detailed captions and provides audio output in English, Kannada, and Hindi, fostering inclusivity and independence.
- Host: GitHub
- URL: https://github.com/bhoomikaniranjan/depiction-of-image-features-with-audio-to-aid-visually-impaired-persons
- Owner: bhoomikaniranjan
- Created: 2025-01-20T15:36:12.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2025-05-06T11:20:26.000Z (6 months ago)
- Last Synced: 2025-05-06T12:44:58.755Z (6 months ago)
- Topics: image-captioning, language-translation, pillow, python, python-project
- Language: Python
- Homepage:
- Size: 8.79 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Depiction of image features with audio to aid visually impaired person🖼️🔊
## About
This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.
## 🚀Key Features
Image Captioning: Generate meaningful captions based on the content of images.
Language Translation: Translate captions from English to Kannada and Hindi.
Speech Conversion: Convert captions to audio files using gTTS for ease of access.
Multi-modal Application: Supports both visual and auditory outputs for different use cases.
## 🔍💡Use Cases
Accessibility Aid: Helps visually impaired users by describing images via audio.
Language Learning Tool: Supports language translation for educational purposes.
Interactive Learning: Enhances digital learning tools with multi-language support.
🎯 The goal of this project is to:
1. **Generate meaningful captions** for images.
2. **Translate captions** into regional languages (English,Kannada & Hindi).
3. **Convert captions to audio** for accessibility.
## This tool can be useful in various applications such as:
- Assisting visually impaired individuals with image descriptions.
- Learning language translations through images.
- Enhancing interactive educational tools.
## 📸Outputs

