https://github.com/arjo129/uSpeech

Speech recognition toolkit for the arduino
https://github.com/arjo129/uSpeech

arduino signal speech-processing speech-recognition

Last synced: about 1 year ago
JSON representation

Speech recognition toolkit for the arduino

Host: GitHub
URL: https://github.com/arjo129/uSpeech
Owner: arjo129
License: mit
Archived: true
Created: 2012-08-12T03:07:14.000Z (almost 14 years ago)
Default Branch: 4.x-workingBranch
Last Pushed: 2021-05-05T10:12:34.000Z (about 5 years ago)
Last Synced: 2025-04-18T21:26:21.655Z (about 1 year ago)
Topics: arduino, signal, speech-processing, speech-recognition
Language: C++
Homepage: https://arjo129.wordpress.com/experiments/%C2%B5speech/
Size: 482 KB
Stars: 474
Watchers: 66
Forks: 102
Open Issues: 6
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

Awesome-arduino - uSpeech - Speech recognition toolkit for the Arduino (Libraries)
awesome-arduino - uSpeech - Speech recognition toolkit for the Arduino (Libraries)

README

# uSpeech library #
The uSpeech library provides an interface for voice recognition using the Arduino. It currently produces phonemes, often the library will produce junk phonemes. Please bare with it for the time being. A noise removal function is underway.

## Minimum Requirements ##
The library is quite intensive on the processor. Each sample collection takes about 3.2 milliseconds so pay close attention to the time. The library has been tested on the Arduino Uno (ATMega32). Each signal object uses up 160bytes. No real time scheduler should be used with this.

## Features ##
- Letter based recognition
- Small memory footprint
- Arduino Compatible
- No training required (not anymore)
- Fixed point arithmetic
- 30% - 40% accuracy if based on phonemes, up to 80% if based on words.
- Plugs directly into an ``analogRead()`` port

## Documentation ##

Head over to the [wiki](https://github.com/arjo129/uSpeech/wiki) and you will find most of the documentation required.

## Algorithm ##
The library utilizes a special algorithm to enable speech detection. First the complexity of the signal is determined by taking
the absolute derivative of the signal multiplying it by a fixed point saclar and then dividing it by the absolute integral of the signal.
Consonants (other than R,L,N and M) have a value above 40 and vowels have a value below 40. Consonants, they can be divided into frictaves and plosives. Plosives are like p or b whereas frictaves are like
s or z. Generally each band of the complexity coeficient (abs derivative over abs integral) can be matched to a small set of frictaves
and plosives. The signal determines if it is a plosive or a frictave by watching the length of the utterance (plosives occur over short periods while frictaves over long).
Finally the most appropriate character is chosen.

- [Return to main page](http://arjo129.github.com)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/arjo129/uSpeech

Awesome Lists containing this project

README