{"id":20500081,"url":"https://github.com/zevaverbach/tpro","last_synced_at":"2026-04-24T03:33:47.975Z","repository":{"id":52702315,"uuid":"152815238","full_name":"zevaverbach/tpro","owner":"zevaverbach","description":"Transcript processing from STT services to standardized formats.","archived":false,"fork":false,"pushed_at":"2021-04-20T18:17:53.000Z","size":441,"stargazers_count":1,"open_issues_count":9,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-11-21T20:11:37.674Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zevaverbach.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-10-12T22:51:45.000Z","updated_at":"2019-07-18T15:13:26.000Z","dependencies_parsed_at":"2022-08-22T08:51:00.692Z","dependency_job_id":null,"html_url":"https://github.com/zevaverbach/tpro","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/zevaverbach/tpro","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftpro","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftpro/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftpro/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftpro/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zevaverbach","download_url":"https://codeload.github.com/zevaverbach/tpro/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zevaverbach%2Ftpro/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32208399,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-24T03:15:14.334Z","status":"ssl_error","status_checked_at":"2026-04-24T03:15:11.608Z","response_time":64,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-15T18:19:44.957Z","updated_at":"2026-04-24T03:33:42.968Z","avatar_url":"https://github.com/zevaverbach.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# tpro\n\nTranscript Processing! `tpro` takes transcripts produced by\nvarious speech-to-text services and converts them to various standardized\nformats.\n\n![demo](demo.gif)\n\n# Installation and Usage\n\n## Non-pip Requirement:  Stanford NER JAR\n\n  - download and unzip [this](https://nlp.stanford.edu/software/stanford-ner-2018-10-16.zip)\n  - put these files in in /usr/local/bin/:\n    - stanford-ner.jar\n    - classifiers/english.all.3class.distsim.crf.ser.gz\n  - you might have to [update Java](https://askubuntu.com/questions/508546/howto-upgrade-java-on-ubuntu-14-04-lts) on Linux\n\n## Pip\n\n    $ pip install tpro\n\n## Usage\n\n    $ tpro --help\n\n    Usage: tpro [OPTIONS] TRANSCRIPT_DATA_PATH OUTPUT_PATH\n                [amazon|gentle|speechmatics|google] [universal|vo]\n\n    Options:\n      -p, --print-output    pretty print the transcript, breaks pipeability\n      --language-code TEXT  specify language, defaults to en-US.\n      --help                Show this message and exit.\n\n# STT Services\n\n- [Speechmatics](https://www.speechmatics.com/)\n- [Amazon Transcribe](https://aws.amazon.com/transcribe/)\n- [Gentle](https://github.com/lowerquality/gentle)\n- [Google Cloud Speech-to-Text](https://cloud.google.com/speech-to-text/)\n\n## Planned\n\n- [Watson](https://www.ibm.com/watson/services/speech-to-text/) \n- [Mozilla's new open-source STT thing](https://github.com/mozilla/DeepSpeech)\n\n# Output Formats\n\n- [Universal Transcript](https://gist.github.com/zevaverbach/d2b7a19397607677878aa3268fda1002#example) (JSON)\n- [viraloverlay](https://github.com/zevaverbach/viraloverlay#json-transcript-format) (JSON)\n\n## Planned\n\n- Draft.js JSON\n- Word (`.doc`, `.docx`)\n- text files\n- SRT (subtitles)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzevaverbach%2Ftpro","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzevaverbach%2Ftpro","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzevaverbach%2Ftpro/lists"}