https://github.com/dirkzon/gesture-recognition

Real time gesture recognition using OpenCV & Mediapipe.
https://github.com/dirkzon/gesture-recognition

artificial-intelligence gesture-recognition mediapipe-hands opencv-python python sklearn

Last synced: 9 months ago
JSON representation

Real time gesture recognition using OpenCV & Mediapipe.

Host: GitHub
URL: https://github.com/dirkzon/gesture-recognition
Owner: dirkzon
License: mit
Created: 2021-06-15T14:03:18.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2024-12-31T10:40:25.000Z (about 1 year ago)
Last Synced: 2024-12-31T11:31:33.004Z (about 1 year ago)
Topics: artificial-intelligence, gesture-recognition, mediapipe-hands, opencv-python, python, sklearn
Language: Jupyter Notebook
Homepage:
Size: 25.5 MB
Stars: 6
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # gesture recognition



Real time gesture recognition using OpenCV & Mediapipe











[![license](https://img.shields.io/badge/license-MIT-green)](LICENSE)

[![made-with-python](https://img.shields.io/badge/Made%20with-Python-1f425f)](https://www.python.org/) 





  



## Contents

- [About](#about)

- [Features](#features)

- [Setup](#setup)

  - [Prerequisites](#prerequisites)

  - [Starting the final application](#starting-the-final-application)

- [Approach](#approach)

- [Dataset](#dataset)

- [Work with the data](#work-with-the-data)

- [Use your own data](#use-your-own-data)

- [License](#license)

## About

Made for my fourth semester with Python. The goal of this project was to create a real time gesture recognition app which is able to recognize american sign language gestures. The [model](./model) that this app uses is able to recognize gestures for digits '0' to '9'. The [notebook](Gesture%20recognition.ipynb) explains how this model was made.

## Features

- __real time:__ gesture recognition in real time.

- __multiple hands:__ multiple hands can be recognized at once.

- __ambidextrous:__ recognize gestures on both left and right hand.

- __debug:__ visualize hand skeleton, confidence & more.

## Setup

### Prerequisites

- Camera/Webcam

- [Python](https://www.python.org/)

- [Jupyter](https://jupyter.org/install)

### Starting the final application

1. CD into the project folder.

```

cd gesture-recognition

```

2. Create and start a new python environment.

```

python -m venv venv

```

```

venv/Scripts/activate

```

3. Install all requirements using the [requirements.txt](requirements.txt) file.

```

pip install -r requirements.txt

```

4. Run the [gesture.py](gesture.py) file.

```

python gesture.py

```

5. Close the application by pressing `q`, and deactivate the enrionment.

```

deactivate

```

## Approach

The [approach](approach.pdf) document explains the score of the project and my approach to different topics like choosing a hand pose estimation model, getting a dataset and training a model. It also contains some reccomentations for all these subjects.

## Dataset

The [Sign Language Digits Dataset](https://github.com/ardamavi/Sign-Language-Digits-Dataset) was used to train and test the model.

>*Mavi, A., (2020), “A New Dataset and Proposed Convolutional  Neural Network Architecture for Classification of American Sign Language Digits”, arXiv:2011.08927 [cs.CV]*

This repository does __NOT__ contain this data. So, if you want to process the images yourself you will have to download the dataset and put all the subfolders into a folder called `images`.

The structure should look like this:

```

. 

├─ 🗋 Gesture recognition.ipynb 

├─ 🗁 images/

│  ├─🗀 0/ 

│  ├─🗀 1/ 

│  ├─🗀 2/ 

│  ├─🗀 4/ 

│  ├─🗀 5/ 

│  ├─...

```

If you wish to use your own dataset click [Here](#use-your-own-data) to see how.

## Work with the data

If you want to work on pre-processing the dataset yourself you can use the [notebook](Gesture%20recognition.ipynb). It's not necessary to process all the images yourself. the [`dataframes`](dataframes) directory contains both the raw points data and the pre-processed data.

The [gesture-points-raw](dataframes/gesture-points-raw.csv) data is not cleaned and contains a number of missing values. This dataframe can be used if you want to try your own technique of pre-processing.

The [gesture-points-processed](dataframes/gesture-points-processed.csv) contains the pre-processed dataframe. This is the pre-processed version of the raw dataframe. All missing values have been fixed, the data has been normalized and a flipped version of the dataframe has been appeded. This dataset can be used if you want to try out your own technique of modeling.

## Use your own data

The [notebook](Gesture%20recognition.ipynb) should also work with other datasets of gesture images. But you have to make sure that the images are put into a folder called `images`. The structure should look [the same](#dataset) as if you were using the original dataset. The openpose hand model is __NOT__ included inside this repository. This model can be downloaded [here](https://www.kaggle.com/changethetuneman/openpose-model?select=pose_iter_102000.caffemodel), and needs to be put in the [`openpose`](openpose) directory.

There are some limitations for the dataset that need to be taken into account:

- The hand in the image must be larger than 60x60 pixels.

- Lower exposure images work better.

In the [Dataset restrictions](Dataset%20restrictions.ipynb)  you can see how i found these limitations. In the [approach](approach.pdf) document you can also find some more reccomendations for the dataset.

## License

This software is licensed under [MIT](LICENSE)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dirkzon/gesture-recognition

Awesome Lists containing this project

README