Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zevaverbach/tatt
Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.
https://github.com/zevaverbach/tatt
amazon-transcribe-api asr automatic-speech-recognition cli speech-to-text stt
Last synced: 3 months ago
JSON representation
Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.
- Host: GitHub
- URL: https://github.com/zevaverbach/tatt
- Owner: zevaverbach
- License: gpl-3.0
- Created: 2019-02-07T15:21:25.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2019-07-18T10:22:43.000Z (over 5 years ago)
- Last Synced: 2024-07-16T08:47:15.992Z (4 months ago)
- Topics: amazon-transcribe-api, asr, automatic-speech-recognition, cli, speech-to-text, stt
- Language: Python
- Homepage:
- Size: 186 KB
- Stars: 11
- Watchers: 3
- Forks: 0
- Open Issues: 42
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Transcribe All The Things™
tatt is a CLI for creating and managing speech-to-text transcripts.
![demo](demo.gif)
## Installation
pip install tatt
## Dependencies
1. A recording to transcribe.
2. a) An AWS account or b) a Google Cloud account with the speech-to-text API and
Cloud Storage enabled.
3. `ffmpeg` for some operations, but `tatt` will let you know when it's needed.## Usage
### List All Commands
$ transcribe --helpUsage: transcribe [OPTIONS] COMMAND [ARGS]...
Options:
--help Show this message and exit.Commands:
get Downloads and/or saves completed transcript.
languages
list Lists available STT services.
services Lists available speech-to-text services.
status Check the status of a transcription job.
this Sends a media file to be transcribed.### List All STT Services
$ transcribe servicesHere are all the available speech-to-text services:
amazon $0.006 per 15 seconds
google $0.009 per 15 seconds### Get Something Transcribed
$ transcribe thisOkay, transcribing using ...
Okay, job is being transcribed. Use "get" command to download it.$ transcribe this --help
Usage: transcribe this [OPTIONS] MEDIA_FILEPATH SERVICE_NAME
Sends a media file to be transcribed.
Options:
--punctuation only for Google Speech, defaults to True
--speaker-id / --no-speaker-id only for google and amazon, defaults to True
--num_speakers INTEGER only for google and amazon, defaults to 2
--model TEXT only for Google Speech, defaults to
"phone_call"
--use-enhanced only for Google Speech, defaults to True
--language-code TEXT only for google and amazon, defaults to en-
US
--help Show this message and exit.### List Transcripts
$ transcribe listService Job Name Status
------- -------- ------
amazon tester.mp3 IN_PROGRESS
amazon messed_up.mp4 FAILED
amazon done_test.mp3 COMPLETED
amazon also_done.MP3 COMPLETED
google hey_there.mp3 COMPLETED$ transcribe list
Service Job Name Status
------- -------- ------
amazon IN_PROGRESS### Get A Completed Transcript
$ transcribe get # prints to stdout{'accountId': '416321668733',
'jobName': 'a1bace5e-8b08-4ce4-b08c-834a23aafcf1',
'results': {'items': [{'alternatives': [{'confidence': '0.9774',
'content': 'Hi'}],
'end_time': '1.5',
'start_time': '1.23',
'type': 'pronunciation'},
{'alternatives': [{'confidence': '0.9429',
'content': 'is'}],
'end_time': '1.71',
'start_time': '1.5',
'type': 'pronunciation'},
...$ transcribe get --save
Saved transcript to .json
## Services Supported
- [Amazon Transcribe](https://aws.amazon.com/transcribe/)
- [Google Cloud Speech-to-Text](https://cloud.google.com/speech-to-text/)### Planned
- [Watson](https://www.ibm.com/watson/services/speech-to-text/)
- [Kaldi](https://github.com/kaldi-asr/kaldi) [ and/or things built on it ](https://github.com/lowerquality/gentle)
- [Speechmatics](https://www.speechmatics.com/)
- [Mozilla's new open-source STT thing](https://github.com/mozilla/DeepSpeech)