{"id":23913393,"url":"https://github.com/seapagan/vosk-test","last_synced_at":"2025-10-16T05:25:25.705Z","repository":{"id":271042720,"uuid":"905275498","full_name":"seapagan/vosk-test","owner":"seapagan","description":"Some experiments in using 'Vosk' speech-to-text under Python including real-time from a microphone over the web.","archived":false,"fork":false,"pushed_at":"2024-12-30T14:16:12.000Z","size":24,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-09-29T09:34:36.838Z","etag":null,"topics":["python","speech-to-text","vosk","vosk-api","websocket"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/seapagan.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-12-18T13:57:52.000Z","updated_at":"2024-12-30T14:18:20.000Z","dependencies_parsed_at":"2025-01-05T06:01:52.867Z","dependency_job_id":null,"html_url":"https://github.com/seapagan/vosk-test","commit_stats":null,"previous_names":["seapagan/vosk-test"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/seapagan/vosk-test","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seapagan%2Fvosk-test","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seapagan%2Fvosk-test/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seapagan%2Fvosk-test/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seapagan%2Fvosk-test/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/seapagan","download_url":"https://codeload.github.com/seapagan/vosk-test/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/seapagan%2Fvosk-test/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279159515,"owners_count":26116488,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-16T02:00:06.019Z","response_time":53,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["python","speech-to-text","vosk","vosk-api","websocket"],"created_at":"2025-01-05T09:44:48.980Z","updated_at":"2025-10-16T05:25:25.643Z","avatar_url":"https://github.com/seapagan.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Speech To Text Experiments\n\nJust some experiments with speech to text using the `vosk` package and models.\nVosk is not as accurate as `Whisper`, but transcription is a lot faster. I'll be\nlooking at the `fast-whisper` package later too.\n\n## Installation\n\n```bash\nuv sync\nsource .venv/bin/activate\n```\n\nYou also need to download a suitable`vosk` model. The model should be extracted\nin it's own named folder in the root of the repository. You can get the models\nfrom the [vosk website](https://alphacephei.com/vosk/models).\n\n\u003e [!NOTE]\n\u003e\n\u003e It should be possible to pass the resulting transcribed result through another\n\u003e model like [recasepunc](https://github.com/benob/recasepunc) to end up with a\n\u003e properly punctuated and cased string. This is something to look into adding\n\u003e later.\n\n## Files\n\n- `backend.py`: A FastAPI backend and Jinja template that uses the `vosk`\npackage to transcribe audio files. This version will only return once the\nrecording is done.\n- `test_microphone.py`: A script that uses the `sounddevice` package to record\n  from the microphone and transcribe the audio in real-time using the `vosk`\n  package (modified from the original code from the `vosk` package).\n- `transcribe_file.py`: A script that uses the `vosk` package to transcribe an\n  audio file.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fseapagan%2Fvosk-test","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fseapagan%2Fvosk-test","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fseapagan%2Fvosk-test/lists"}