Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/j77m/stuffy-nose-recognition

Trained neural network for recognizing speaking with stuffy nose.
https://github.com/j77m/stuffy-nose-recognition

Last synced: 28 days ago
JSON representation

Trained neural network for recognizing speaking with stuffy nose.

Awesome Lists containing this project

README

        

# stuffy-nose-recognition
Trained neural network for recognizing speaking with stuffy nose.

Extension : real time recognition on 2 seconds frames.

Model was trained only on data from one person (me), so there is possibility to record data and train model.

Also record new data and train model if demand is smaller frame (more real time recognition).

(There was extension in training process, instead of training on recording data, to train on analysed wav files, but results on real time were about 60% accuracy, so feel free to try and improve)



|

|__ training

| |_ data # data recorded by data_record

| |_ model_evaluate # folder for tensorboard evaluations

| |_ trained_models # trained models

| |_ data_record.py # record and saves

| |_ model.py # neural network model

| |_ utils.py # useful functions

| |_ audio.py # class for audio processing

|

|__ real_time_analysis.py # real time use of trained neural network






Testing showed, that the best architecture is :

_________________________________________________________________

Layer (type) Output Shape Param #

=================================================================

conv1d (Conv1D) (None, 991, 64) 704

_________________________________________________________________

max_pooling1d (MaxPooling1D) (None, 247, 64) 0

_________________________________________________________________

conv1d_1 (Conv1D) (None, 238, 64) 41024

_________________________________________________________________

max_pooling1d_1 (MaxPooling1 (None, 59, 64) 0

_________________________________________________________________

flatten (Flatten) (None, 3776) 0

_________________________________________________________________

dense (Dense) (None, 256) 966912

_________________________________________________________________

dense_1 (Dense) (None, 1) 257

=================================================================

Total params: 1,008,897

Trainable params: 1,008,897

Non-trainable params: 0

_________________________________________________________________

None


batch_size = 5, epochs=10

100% validation accuracy on my data (i wont share them, because of my privacy :) )

data - total: 4.5 min of recording (2.25 min talking with stuffy nose - nose stuffed with fingers, 2.25 min talking with clear nose)



If you are training model, please test and evaluate your combinations of architecture.

Training code is adapted for tensorboard evaluation.



J.M.