{"id":13825651,"url":"https://github.com/zevaverbach/tatt","last_synced_at":"2025-04-13T18:52:02.186Z","repository":{"id":57473506,"uuid":"169592697","full_name":"zevaverbach/tatt","owner":"zevaverbach","description":"Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.","archived":false,"fork":false,"pushed_at":"2019-07-18T10:22:43.000Z","size":190,"stargazers_count":11,"open_issues_count":42,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-04-04T22:07:24.159Z","etag":null,"topics":["amazon-transcribe-api","asr","automatic-speech-recognition","cli","speech-to-text","stt"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zevaverbach.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-02-07T15:21:25.000Z","updated_at":"2023-03-20T08:06:59.000Z","dependencies_parsed_at":"2022-09-26T17:40:49.089Z","dependency_job_id":null,"html_url":"https://github.com/zevaverbach/tatt","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftatt","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftatt/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftatt/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftatt/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zevaverbach","download_url":"https://codeload.github.com/zevaverbach/tatt/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248765984,"owners_count":21158296,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["amazon-transcribe-api","asr","automatic-speech-recognition","cli","speech-to-text","stt"],"created_at":"2024-08-04T09:01:24.762Z","updated_at":"2025-04-13T18:52:02.146Z","avatar_url":"https://github.com/zevaverbach.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# Transcribe All The Things™\n\ntatt is a CLI for creating and managing speech-to-text transcripts.\n\n![demo](demo.gif)\n\n## Installation\n\n    pip install tatt\n\n## Dependencies\n\n1. A recording to transcribe.\n2. a) An AWS account or b) a Google Cloud account with the speech-to-text API and\n   Cloud Storage enabled.\n3. `ffmpeg` for some operations, but `tatt` will let you know when it's needed.\n\n\n## Usage\n\n### List All Commands\n    $ transcribe --help\n\n    Usage: transcribe [OPTIONS] COMMAND [ARGS]...\n\n    Options:\n      --help  Show this message and exit.\n\n    Commands:\n      get        Downloads and/or saves completed transcript.\n      languages\n      list       Lists available STT services.\n      services   Lists available speech-to-text services.\n      status     Check the status of a transcription job.\n      this       Sends a media file to be transcribed.\n\n### List All STT Services\n    $ transcribe services\n\n    Here are all the available speech-to-text services:\n\n      amazon\t\t$0.006 per 15 seconds\n      google\t\t$0.009 per 15 seconds\n\n### Get Something Transcribed\n    $ transcribe this \u003cpath_to_media_file\u003e \u003cservice_name\u003e\n\n    Okay, transcribing \u003cpath_to_media_file\u003e using \u003cservice_name\u003e...\n    Okay, job \u003cjob_name\u003e is being transcribed.  Use \"get\" command to download it.\n\n    $ transcribe this --help\n\n    Usage: transcribe this [OPTIONS] MEDIA_FILEPATH SERVICE_NAME\n\n    Sends a media file to be transcribed.\n\n    Options:\n      --punctuation                   only for Google Speech, defaults to True\n      --speaker-id / --no-speaker-id  only for google and amazon, defaults to True\n      --num_speakers INTEGER          only for google and amazon, defaults to 2\n      --model TEXT                    only for Google Speech, defaults to\n                                      \"phone_call\"\n      --use-enhanced                  only for Google Speech, defaults to True\n      --language-code TEXT            only for google and amazon, defaults to en-\n                                      US\n      --help                          Show this message and exit.\n\n### List Transcripts\n    $ transcribe list\n\n    Service Job Name                                Status\n    ------- --------                                ------\n    amazon  tester.mp3                              IN_PROGRESS\n    amazon  messed_up.mp4                           FAILED\n    amazon  done_test.mp3                           COMPLETED\n    amazon  also_done.MP3                           COMPLETED\n    google  hey_there.mp3                           COMPLETED\n\n\n    $ transcribe list \u003cjob_name\u003e\n\n    Service Job Name                                Status\n    ------- --------                                ------\n    amazon  \u003cjob_name\u003e                              IN_PROGRESS\n\n\n### Get A Completed Transcript\n    $ transcribe get \u003cjob_name\u003e # prints to stdout\n\n    {'accountId': '416321668733',\n     'jobName': 'a1bace5e-8b08-4ce4-b08c-834a23aafcf1',\n     'results': {'items': [{'alternatives': [{'confidence': '0.9774',\n                                              'content': 'Hi'}],\n                            'end_time': '1.5',\n                            'start_time': '1.23',\n                            'type': 'pronunciation'},\n                           {'alternatives': [{'confidence': '0.9429',\n                                              'content': 'is'}],\n                            'end_time': '1.71',\n                            'start_time': '1.5',\n                            'type': 'pronunciation'},\n                           ...\n\n    $ transcribe get --save \u003cjob_name\u003e\n\n    Saved transcript to \u003cjob_name\u003e.json\n\n\n## Services Supported\n\n  - [Amazon Transcribe](https://aws.amazon.com/transcribe/)\n  - [Google Cloud Speech-to-Text](https://cloud.google.com/speech-to-text/)\n\n### Planned\n  - [Watson](https://www.ibm.com/watson/services/speech-to-text/) \n  - [Kaldi](https://github.com/kaldi-asr/kaldi) [ and/or things built on it ](https://github.com/lowerquality/gentle)\n  - [Speechmatics](https://www.speechmatics.com/)\n  - [Mozilla's new open-source STT thing](https://github.com/mozilla/DeepSpeech)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzevaverbach%2Ftatt","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzevaverbach%2Ftatt","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzevaverbach%2Ftatt/lists"}