{"id":16444906,"url":"https://github.com/jaymon/transcribe","last_synced_at":"2025-03-21T05:30:33.304Z","repository":{"id":55527806,"uuid":"119768227","full_name":"Jaymon/transcribe","owner":"Jaymon","description":"Convert images or audio files to plain text on the command line","archived":false,"fork":false,"pushed_at":"2019-09-23T22:56:23.000Z","size":36,"stargazers_count":31,"open_issues_count":6,"forks_count":7,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-03-01T03:22:05.323Z","etag":null,"topics":["command-line-tool","ocr","python","speech-recognition","speech-to-text","transcription"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Jaymon.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-02-01T01:54:58.000Z","updated_at":"2025-02-24T13:07:47.000Z","dependencies_parsed_at":"2022-08-15T02:31:10.542Z","dependency_job_id":null,"html_url":"https://github.com/Jaymon/transcribe","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jaymon%2Ftranscribe","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jaymon%2Ftranscribe/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jaymon%2Ftranscribe/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Jaymon%2Ftranscribe/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Jaymon","download_url":"https://codeload.github.com/Jaymon/transcribe/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244114683,"owners_count":20400404,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["command-line-tool","ocr","python","speech-recognition","speech-to-text","transcription"],"created_at":"2024-10-11T09:42:33.169Z","updated_at":"2025-03-21T05:30:32.878Z","avatar_url":"https://github.com/Jaymon.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Transcribe\n\n1. _verb_ - To convert a representation of language to another representation.\n2. _noun_ - Command line application to extract plain text from images and audio files, written in Python.\n\n## OCR\n\nTo extract the text from an image:\n\n    $ transcribe ocr /path/to/image.jpg\n\n\n## Speech\n\nTo extract the text from an audio file:\n\n    $ transcribe speech /path/to/audio.mp3\n\n\n## Installation\n\n### Google Cloud setup\n\nTranscribe uses Google's cloud services to perform the text extraction, that means you have to setup a Google cloud project. Below is roughly how I did that...\n\nI went to [The console](https://console.cloud.google.com)\n\nSelected _Project_ in the top left corner (to the right of _Google Cloud Platform_) and then create a new project, wait a bit for it to create the project and switch to it (this took like 20 seconds while I was trying to figure out what the heck was going on). Then activate the vision and speech apis (I think storage is automatically activated) and then select Credentials, and create an api key. [More auth info here](https://cloud.google.com/vision/docs/common/auth).\n\nTurns out I also needed to [create a service json file](https://cloud.google.com/storage/docs/authentication#generating-a-private-key). To do this I needed to go to [the dashboard for the project](https://console.cloud.google.com/apis/credentials?project=vision-157908) and then click __Create credentials__ and choose __Service account key__.\n\nThen add this to your `.bash_profile` or something similar:\n\n```\nexport GOOGLE_APPLICATION_CREDENTIALS=\"/path/to/keyfile.json\"\n```\n\n### ffmpeg\n\nIf you're on a Mac and you use Homebrew, you can install ffmpeg like:\n\n    $ brew install --with-opus ffmpeg\n\nIf you're not on a Mac or don't use Homebrew you're on your own.\n\n\n### Transcribe\n\nInstall using pip:\n\n    $ pip install transcribe\n\nor the latest and greatest:\n\n    $ pip install \"git+https://github.com/Jaymon/transcribe#egg=transcribe\"\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjaymon%2Ftranscribe","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjaymon%2Ftranscribe","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjaymon%2Ftranscribe/lists"}