Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/efecanxrd/speech-recognition
Identify speaker from given speech signal using MFCC features and Gaussian Mixture Models
https://github.com/efecanxrd/speech-recognition
gaussian-mixture-models gaussianmixturemodel gmm mfcc mfcc-algorithm mfcc-features python python-speech-features python-speechrecognition python27 recognition sklearn sklearn-gmm speech speech-recognition speech-recognizer tensorflow
Last synced: 20 days ago
JSON representation
Identify speaker from given speech signal using MFCC features and Gaussian Mixture Models
- Host: GitHub
- URL: https://github.com/efecanxrd/speech-recognition
- Owner: efecanxrd
- License: mit
- Created: 2022-01-19T16:55:29.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-02-23T08:46:47.000Z (almost 2 years ago)
- Last Synced: 2024-12-22T03:05:04.081Z (20 days ago)
- Topics: gaussian-mixture-models, gaussianmixturemodel, gmm, mfcc, mfcc-algorithm, mfcc-features, python, python-speech-features, python-speechrecognition, python27, recognition, sklearn, sklearn-gmm, speech, speech-recognition, speech-recognizer, tensorflow
- Language: Python
- Homepage:
- Size: 1.89 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Speech Recognition
![EfecanLogo](https://avatars.githubusercontent.com/u/66366306?s=100&u=dc5e6f5b4a05d07958d9a867b803760aa2b1613e&v=4)
### A project with deep learning networks that recognizes who owns a voice using libraries like scipy, sklearn, and python-speech-features
![XhW](https://i.imgur.com/qHAcfhX.gif)
## Setup This Project
### Install Python2.7
- I recommend that you install Anaconda and install python 2.7 from the environments part of anaconda.
- After this installation, you can run the project by going to the project directory in your terminal and typing ```conda activate python2x```
### Install Libraries
- Switch to **Python2x** environment by typing ```conda activate Python2x```
- Then you can install the modules by typing ```pip install -r requirements.txt``` in the terminal.
- Since you are running the project around the **Python2x** environment, you must also enable **Python2x** for use **pip** command
## How this is working?
A program that recognizes the sound of the specified file using models using methods such as mfcc gmm. **Code comments were entered as # comments on each line.**
- train.py : Used for model audio files in trainData folder
- recognize.py : It is used to define a selected file or all files in the data folder.
- requirements.txt : Text file containing the necessary libraries
- ./models : The folder where train.py outputs and recognize.py uses. Here are the models of the trained audio files.
- ./Data : The files here use recognize.py. Here you should drop the file you want to define .
- ./trainingData : Sound files to be used for modeling are placed here.For each audio file, it should be recorded as **VoiceName-Integer/VoiceFile.wav** Example: **Melissa-005/Melissa.wav** | Since the folder name is split, it should be saved like this.
You can get information by the link below. You might need to translate the page
[](https://efecanxrd.medium.com/mfcc-mel-frequency-cepstral-coefficients-ve-gmm-gaussian-mixture-model-teknikleri-f81a2cc632a5)