Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mateusaquino/subvocalization-emg

🧠 Project for recording and training subvocalization EMG data with the Cyton Board.
https://github.com/mateusaquino/subvocalization-emg

emg openbci python subvocalization

Last synced: about 2 months ago
JSON representation

🧠 Project for recording and training subvocalization EMG data with the Cyton Board.

Awesome Lists containing this project

README

        







Subvocalization EMG


Project for recording and training subvocalization EMG data with the Cyton Board.

by Mateus de Aquino Batista for the Bachelor's Degree Final Project.











## đź“„ Abstract

> Dysarthria is a change in the normal pronunciation of words, usually caused by neurological disturbances. In advanced cases, when speech therapy is not enough to enable communication in a practical way, a silent speech interface can be used to carry out a conversation with a smaller vocabulary, through the identification of subtle movements in speech captured by an electromyography device of surface.

> The present study deals with an instrumentation project of a low-cost silent speech interface and the development of a neural network to identify subvocalized words together plus an experimental case study for the equipment validation. An OpenBCI electromyography modular board will be used to read muscle activity data from the face, and these data will be later on processed, and classified by a convolutional neural network for the identification of subvocalized words.

> Regarding the validation, the success rates of the neural network of the silent speech interface will be used to evaluate the general and individual performance for each participant. A hit rate between 90 and 95% is expected from the general validation.

> Henceforth, with positive results coming from the interface, more words can be added in the advance, in order to guarantee greater usefulness in the daily lives of patients with communication difficulties.

## 🔧 Hardware Requirements

Cyton Biosensing Board

This project requires setting up a [Cyton Biosensing Board (8-Channels)](https://shop.openbci.com/products/cyton-biosensing-board-8-channel), which is a neural interface board developed by OpenBCI with a 32-bit processor that can be used to sample EEG, EMG, and ECG activity. Follow the starter guide to make sure you get it right.

- [Getting Starter Guide](https://docs.openbci.com/GettingStarted/Boards/CytonGS/)

This project also allows using the Synthetic board as a mock to the Cyton board. However, as it generates random data, training and validation of the Neural Networks with the `Synthetic` option will not work.

## :warning: Important

This study is currently underway, and as such, the findings outlined in this article are preliminary and subject to change. The ongoing development phase of the project means that its present iteration is intended solely for research purposes. If you're interested, you can explore the progress in the [Papers](#-papers) section (Portuguese).

## 🚀 Setup

After setting up your Cyton Board, you'll need to install the package dependencies:

```bash
python -m venv .venv # optional: install requirements into a virtual env
source .venv/bin/activate # optional: activate virtual env
pip install -r requirements.txt
```

Once you're done simply run:

```py
python ./start.py
```

It should be accessible at [`localhost:8000`](http://localhost:8000). In case the Cyton dongle is not available **you might need to run with administrator privileges**.

## 🧠 How to use

The main page includes the Time Series (filtered) for all 8 channels, you can also see some logging information and access to the board session on top of the page. Once the session is started you'll have access to the Recording tab, a page to setup the words and amount of information you'll want to train later. Note that all the default existing words are currently hardcoded into the HTML file, but they can be changed anytime:

| EMG Tab | Recording Tab |
| :---------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------------------------------: |
| ![EMG Tab](https://user-images.githubusercontent.com/16140783/218353244-99ca24d5-75c7-496a-9e3e-63f4a62a2ae9.png) | ![Recording Tab](https://user-images.githubusercontent.com/16140783/218360435-4d74c0dc-af67-4815-9a15-9b2f6b2efbb8.png) |

After recording your first session (automatically saved as a csv file), the Neural Network tab will be available for training. This is where you include all recordings and setup all the training configs. Once started, you can check the training progress in real time. After the training is complete, you'll have access to the Evaluation tab, where you can test the predicting capability of the models you've trained.

| Neural Network Tab | Evaluator Tab |
| :--------------------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------------------------------: |
| ![Neural Network Tab](https://user-images.githubusercontent.com/16140783/218360970-d672cc7c-84f6-45b2-b0d8-07ab01c3c57f.png) | ![Evaluator Tab](https://user-images.githubusercontent.com/16140783/218363114-6beeef22-7dee-4b15-8b75-3813ef2cd1fe.png) |

## đź“Ś Electrodes Placement

The montage was defined using [MIT's preliminary study](https://www.semanticscholar.org/paper/AlterEgo%3A-A-Personalized-Wearable-Silent-Speech-Kapur-Kapur/9e2af148acbf7d4623ca8a946be089a774ce5258) about top muscle regions evaluated in a pilot user study:

Electrodes placement schema on Cyton Board
:-------------------------:
![image](https://github.com/MateusAquino/subvocalization-emg/assets/16140783/054e7d42-3d2f-4a5b-b91d-b085af1cd919)

| Region | Color | Pin |
| :-----------------: | :----: | :---------: |
| Earlobe | Black | BIAS |
| Mental | Yellow | N3P (upper) |
| Inner laryngeal | Blue | N1P (upper) |
| Outer laryngeal | Red | N1P (lower) |
| Hyoid | Green | N3P (lower) |
| Inner infra-orbital | Purple | N2P (upper) |
| Outer infra-orbital | Brown | N4P |
| Buccal | Orange | N2P (lower) |

Selection of the final electrode target areas through feature selection on muscular areas of interest
:-------------------------:
![image](https://github.com/MateusAquino/subvocalization-emg/assets/16140783/21a0c13a-d37e-4e36-b32a-07e4ff094187)
Arnav Kapur, S. Kapur, P. Maes, et al. (2018)

## đź“» Proof of Concept (PoC)

The PoC containing the steps of processing and training can be accessed by [Complete processing.ipynb](./Proof%20of%20Concept/Complete%20processing.ipynb). This Jupyter Notebook has all the important pieces of code to reproduce the experiment, and also some visual graphs for a better understanding.

| Synthetic 8-Channels Input | Words visualization |
| :----------------------------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------------------------------------: |
| ![Synthetic 8-Channels Input](https://user-images.githubusercontent.com/16140783/218466330-a7a2dcd3-b697-4bd3-943f-6236011e74d7.png) | ![Words visualization](https://user-images.githubusercontent.com/16140783/218466062-cf9fd2a4-2335-4acd-87df-b1b78e89fd98.png) |

If you want to see the PoC with public EMG data instead, you can check [Public data.ipynb](./Proof%20of%20Concept/Public%20data.ipynb), processing a public [EMG hand gesture dataset](https://zenodo.org/record/7668251).

Also, if you want to run the ipynb notebook in a virtualenv, make sure you setup jupyter correctly:

```bash
source .venv/bin/activate
python -m pip install ipykernel # install ipykernel / jupyter in the venv if not present
python -m ipykernel install --user --name=venv # self-install
# > Then, open Jupyter Notebook and select venv in "Switch kernel" option
```

## đź“š Papers

- [x] [**_Desenvolvimento de um software para gravação e processamento de dados de eletromiografia para reconhecimento de comandos e termos_**](https://ctic.univap.br/ctic/qualivitae/resumos/anais_qualivitae_2023.pdf#page=28):
- This paper was showcased during the [XXII Congresso SaĂşde e Qualidade de Vida - Qualivitae](https://ctic.univap.br/ctic/qualivitae/index.php) (2023), marking the initial phase of our research. The primary objective of this phase was the creation of a customizable backend and frontend capable of capturing and analyzing EMG data. It's important to note that this paper exclusively employed synthetic data; no human data was involved in this particular study.
- [x] [**_Desenvolvimento de uma interface de fala silenciosa utilizando deep learning e emg no processamento de subvocalização_**](https://www.inicepg.univap.br/cd/INIC_2023/anais/arquivos/RE_0007_0371_01.pdf)
- This paper is currently in progress and is scheduled for presentation at the [XXVII Encontro Latino Americano de Iniciação Científica](https://www.inicepg.univap.br/home). The main objective of this endeavor is to employ the identical processing algorithms and neural network that were developed during the previous research, this time using publicly available EMG data.

This marks the concluding phase of our research. For this publicly available repository, we included data from human participants and shared the electrodes placement, as we have obtained approval from Brazil's Ethics Committee (CAAE: 65587722.5.0000.5503, Parecer 6112574).

You can view the processing and results obtained from the human sEMG in the third PoC file: [Subvocalization.ipynb](./Proof%20of%20Concept/Subvocalization.ipynb).

## đź“Š EMG Folder Structure

The dataset folder `/saves` is where all the EMG data is stored by default in the application, however this folder was organized to include the data from all 10 participants of the study. The all sessions recorded from all participants, it is grouped by: `Participants Code` -> `Speech style` -> `Words`.

The Speech style could be any of:

- **F**: Normal speech ("Fala")
- **A**: Lip articulation ("Articulação labial")
- **S**: Subvocalization ("Subvocalização")

The four possible words selected in this study was: Yes ("Sim"), No ("NĂŁo"), Maybe ("Talvez") and Silence, whereas the silence itself is stored inside the folders between the words.

## đź“ť Conclusion

The results demonstrate that this processing and training method is effective for detecting strong gestures, such as speech and articulation. However, for more subtle muscle movements, such as subvocalization, further improvements in noise reduction and data augmentation are necessary.

It is important to note that the same methods, montage, and algorithm yielded varying results across different participants. Some exhibited lower accuracy in detecting easily recognizable speech, while others achieved higher accuracy in identifying less discernible subvocalizations.

## đź“ś License

All source code is made available under a BSD 3-clause license. You can freely use and modify the code, without warranty, so long as you provide attribution to the author. See [LICENSE](./LICENSE) for the full license text.

The author reserve the rights to the article content.