An open API service indexing awesome lists of open source software.

https://github.com/ay-ritesh/sign-language-recognition

Crafted a deep learning-based system for real-time sign language recognition, leveraging computer vision and neural networks. The system translates hand gestures into text to bridge communication gaps for the hearing-impaired.
https://github.com/ay-ritesh/sign-language-recognition

cnn keras nlp opencv python real-time-detection tenserflow yolov5

Last synced: about 1 month ago
JSON representation

Crafted a deep learning-based system for real-time sign language recognition, leveraging computer vision and neural networks. The system translates hand gestures into text to bridge communication gaps for the hearing-impaired.

Awesome Lists containing this project

README

          

# ๐ŸคŸ An Enhanced Deep Learning Approach for American Sign Language (ASL) Recognition

A real-time sign language recognition system using **YOLOv5 + CNN + OpenCV**, designed to translate hand gestures into readable text.
The project helps bridge the communication gap between **sign language users and verbal speakers**, promoting inclusivity in healthcare, education, and public services.

---

## ๐Ÿ‘จโ€๐Ÿ”ฌ Team Details

| Name | Reg. No. | Contribution |
|------|----------|--------------|
| Aayush Amritesh | 21BCE2331 | Data Preprocessing, Model Training & Testing, Evaluation |
| Dev Chandrakar | 21BCE2960 | Literature Survey, Dataset Collection, Documentation |
| Aditya Kumar Jha | 21BCE3759 | Model Selection, UI/Backend Integration |

๐Ÿง‘โ€๐Ÿซ **Guided by:** *Dr. Viswanathan A., Associate Professor (Sr.)*
School of Computer Science and Engineering, VIT

---

## ๐ŸŽฏ Objective

Develop a **real-time ASL Recognition System** that:
- Detects gestures using **YOLOv5**
- Classifies gestures using **CNN**
- Translates gestures to text through a simple user interface
- Supports lowโ€cost consumer hardware like webcams & laptops

---

## ๐Ÿ” Problem Motivation

Over **70M people** rely on sign language. Most verbal speakers donโ€™t understand it, causing a *communication barrier* in critical areas such as hospitals, classrooms, and public services.
Deep learning enables **accessible, affordable, and real-time solutions** for gesture recognition.

---

## ๐Ÿ—๏ธ System Architecture

**Modules:**
1. ๐Ÿ“ท *Data Acquisition* โ€“ Webcam or image input
2. ๐Ÿงน *Preprocessing (OpenCV)* โ€“ Background removal, segmentation
3. ๐ŸŽฏ *Gesture Detection (YOLOv5)* โ€“ Identify hand gesture
4. ๐Ÿง  *Classification (CNN)* โ€“ Recognize gesture category
5. ๐Ÿ’ฌ *Sign-to-Text Translation* โ€“ Output real-time prediction
6. ๐Ÿ–ฅ๏ธ *User Interface* โ€“ Minimal & user-friendly

---

## ๐Ÿงฐ Tools & Technologies

| Category | Tools |
|----------|-------|
| Programming | Python |
| Deep Learning | YOLOv5, CNN (TensorFlow/PyTorch) |
| Image Processing | OpenCV |
| UI/Development | Jupyter/Colab/VS Code |
| Hardware | Standard webcam, i5/8GB RAM laptop |

---

## ๐Ÿ“Š Dataset & Parameters

| Feature | Value |
|--------|--------|
| Image Size | 224ร—224 |
| Classes | 6 (Hello, Yes, No, Thanks, Please, I Love You) |
| Model | YOLOv5s + Custom CNN |
| Optimizer | Adam |
| Latency | **< 100ms real-time** |

---

## ๐Ÿš€ Result Summary

โœ” **Real-time gesture detection**
โœ” Works effectively in **low-light & diverse backgrounds**
โœ” Runs smoothly on **consumer-grade hardware**
โœ” Achieved **95% accuracy (static)** & **90% dynamic** in adequate lighting conditions

---

## ๐Ÿ Conclusion

The system successfully bridges the communication gap by converting American Sign Language (ASL) gestures to readable text in real time using a hybrid deep learning model (YOLOv5 + CNN). It is affordable, portable, and scalable, making it suitable for deployment in schools, hospitals, and public service centers. Future work includes sentence-level recognition, multi-language support, and mobile deployment.

---

## ๐Ÿ”ฎ Future Enhancements

- ๐Ÿ“Œ Support for complete sentence-level gesture recognition
- ๐ŸŒŽ Extension to multiple sign languages (e.g., ISL, BSL, etc.)
- ๐Ÿ“ฑ Mobile application deployment on Android/iOS
- ๐ŸŽ™๏ธ Voice output for recognized text (sign-to-speech)
- ๐Ÿง  Use of Transformer-based or sequence models for better temporal gesture understanding

---