https://github.com/muhammedhossam/asl-detection
Real-time American Sign Language (ASL) letters detection, via PyTorch, OpenCV, YOLOv11, Roboflow and LabelImg
https://github.com/muhammedhossam/asl-detection
asl-recognizer computer-vision deep-learning labelimg opencv python pythorch real-time torch yolo yolov11
Last synced: 2 months ago
JSON representation
Real-time American Sign Language (ASL) letters detection, via PyTorch, OpenCV, YOLOv11, Roboflow and LabelImg
- Host: GitHub
- URL: https://github.com/muhammedhossam/asl-detection
- Owner: muhammedhossam
- Created: 2024-12-09T08:08:51.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-12-09T08:33:08.000Z (10 months ago)
- Last Synced: 2025-02-24T07:16:22.111Z (7 months ago)
- Topics: asl-recognizer, computer-vision, deep-learning, labelimg, opencv, python, pythorch, real-time, torch, yolo, yolov11
- Language: Jupyter Notebook
- Homepage:
- Size: 41.4 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# American Sign Language (ASL) Detection Using YOLO-V11
> **"أما بعد، فإن خير الحديث كتاب الله، وخير الهديِ هديُ محمد، وشر الأمور محدَثاتها، وكل بدعة ضلالة"**
---

## **Objective**
This project aims to create a **real-time ASL detector** to assist with certain tasks by identifying and interpreting hand gestures in videos. While not designed for professional applications, this project serves as a foundational step toward leveraging computer vision for accessible communication.
---
## **Methodology**
1. **Model**: YOLO-V11 (Large Model)
- Selected for its cutting-edge performance in object detection tasks.
2. **Training Details**:
- **Epochs**: 150
- **Batch Size**: 16
- **Image Size**: 640 × 640
- **Classes**: 26 (One for each ASL alphabet)
- **Tracking**: Bounding Box3. **Dataset**:
- [Sign Language Dataset for YOLO](https://www.kaggle.com/datasets/daskoushik/sign-language-dataset-for-yolov7)
4. **Evaluation Metrics**:
- **Confusion Matrix**: Provides insight into classification accuracy across classes.
- **Performance Plots**: Tracks precision, recall, and mAP over training epochs.5. **Output**:
- Real-time detection of ASL gestures.
- Visualized predictions with bounding boxes and class labels.
- Evaluation graphs for precision and recall.---
## **Project Outputs**
### **Confusion Matrix**

### **Training Results**

---
## **Resources**
1. **Dataset**:
Download the training data from Kaggle: [Sign Language Dataset for YOLO](https://www.kaggle.com/datasets/daskoushik/sign-language-dataset-for-yolov7)2. **YOLO Documentation**:
Learn more about the YOLO-V11 model: [YOLO-V11 Docs](https://docs.ultralytics.com/ar/models/yolo11/)---
## **Next Steps**
To make the project even more comprehensive:
1. **Data Augmentation**: Apply techniques like rotation, scaling, and lighting changes to improve robustness.
2. **Hyperparameter Tuning**: Optimize parameters such as learning rate, momentum, and IoU thresholds.
3. **Real-Time Application**: Integrate the model into a live video pipeline using OpenCV or similar libraries.
4. **Deployment**: Host the model on edge devices or cloud services for practical use.
5. **Feedback Loop**: Incorporate user feedback for iterative improvements.---
## **Prayers for Success**
> **اللهم بارك لي في وقتي وأصلح لي شأني ولا تكلني إلى نفسي طرفة عين**
> **اللَّهُمَّ إِنِّى أَعُوذُ بِكَ مِنَ الْعَجْزِ وَالْكَسَلِ، وَالْجُبْنِ وَالْبُخْلِ، وَالْهَرَمِ وَعَذَابِ الْقَبْرِ، اللَّهُمَّ آتِ نَفْسِى تَقْوَاهَا، وَزَكِّهَا أَنْتَ خَيْرُ مَنْ زَكَّاهَا، أَنْتَ وَلِيُّهَا وَمَوْلاَهَا، اللَّهُمَّ إِنِّى أَعُوذُ بِكَ مِنْ عِلْمٍ لاَ يَنْفَعُ، وَمِنْ قَلْبٍ لاَ يَخْشَعُ، وَمِنْ نَفْسٍ لاَ تَشْبَعُ، وَمِنْ دَعْوَةٍ لاَ يُسْتَجَابُ لَهَا**