Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/taf2/speech2text
Using Google Speech to Text API Provide a Simple Interface to Convert Audio Files
https://github.com/taf2/speech2text
Last synced: 5 days ago
JSON representation
Using Google Speech to Text API Provide a Simple Interface to Convert Audio Files
- Host: GitHub
- URL: https://github.com/taf2/speech2text
- Owner: taf2
- Created: 2011-03-24T21:40:32.000Z (over 13 years ago)
- Default Branch: master
- Last Pushed: 2021-03-24T07:01:19.000Z (over 3 years ago)
- Last Synced: 2024-10-15T13:45:01.079Z (19 days ago)
- Language: Ruby
- Homepage: http://calltrackingmetrics.com/
- Size: 1.84 MB
- Stars: 361
- Watchers: 35
- Forks: 86
- Open Issues: 14
-
Metadata Files:
- Readme: README.rdoc
Awesome Lists containing this project
- starred-awesome - speech2text - Using Google Speech to Text API Provide a Simple Interface to Convert Audio Files (Ruby)
README
= Speech2Text
Using the power of ffmpeg/flac/Google and ruby here is a simple interface to play with to convert speech to text.
Using a new undocumentd speech API from Google with the help of this article: http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/
We're able to provide a very simple API in Ruby to decode simple audio to text.
The API from Google is not yet public and so may change. It also seems to be very fragile as more times than not it will return a 500, so the library has retry code built in - for larger audio files 10+ failures may return before a successful result is retrieved...
It also appears that the API only likes smaller audio files so there is a built in chunker that allows us to split the audio up into smaller chunks.
== Installation
Add this line to your application's Gemfile:
gem 'speech2text'
And then execute:
$ bundle
Or install it yourself as:
$ gem install speech2text
You must also install the ffmpeg utility on your local machine. If you are using OS X, the easiest way to do this is to install brew. You may need to instal XQuartz before installing ffmpeg:
$ ruby -e "$(curl -fsSL https://raw.github.com/mxcl/homebrew/go)"
# You may be prompted to install XQuartz. Get it at https://xquartz.macosforge.org
$ brew install ffmpeg== Example
require 'speech'
audio = Speech::AudioToText.new("i-like-pickles.wav")
puts audio.to_text.inspect
=> {"captured_json"=>[["I like pickles", 0.92731786]], "confidence"=>0.92731786}== Command Line
speech2text i-like-pickles.wav
cat i-like-pickles.json
{"captured_json"=>[["I like pickles", 0.92731786]], "confidence"=>0.92731786}