{"id":22107621,"url":"https://github.com/madh93/whisper","last_synced_at":"2026-05-08T10:34:17.968Z","repository":{"id":224101140,"uuid":"762417727","full_name":"Madh93/whisper","owner":"Madh93","description":"🎙️ My Whisper stuff ","archived":false,"fork":false,"pushed_at":"2025-07-23T13:44:31.000Z","size":435,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2026-04-07T06:36:14.705Z","etag":null,"topics":["docker","openai","speech-recognition","speech-to-text","whisper","whisper-cpp"],"latest_commit_sha":null,"homepage":"","language":"Makefile","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Madh93.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-02-23T18:31:46.000Z","updated_at":"2025-07-23T13:44:35.000Z","dependencies_parsed_at":"2024-02-27T22:29:00.261Z","dependency_job_id":"42942c2d-9dc1-4af5-9bfa-fefa0466e3e9","html_url":"https://github.com/Madh93/whisper","commit_stats":null,"previous_names":["madh93/whisper"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Madh93/whisper","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Madh93%2Fwhisper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Madh93%2Fwhisper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Madh93%2Fwhisper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Madh93%2Fwhisper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Madh93","download_url":"https://codeload.github.com/Madh93/whisper/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Madh93%2Fwhisper/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32776866,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-08T08:22:46.396Z","status":"ssl_error","status_checked_at":"2026-05-08T08:22:45.650Z","response_time":54,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["docker","openai","speech-recognition","speech-to-text","whisper","whisper-cpp"],"created_at":"2024-12-01T08:18:21.219Z","updated_at":"2026-05-08T10:34:17.928Z","avatar_url":"https://github.com/Madh93.png","language":"Makefile","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Whisper\n\n[![MIT license](https://img.shields.io/badge/License-MIT-blue.svg)](https://lbesson.mit-license.org/)\n\nPersonal Makefile that provides a set of commands to manage the transcription and conversion process of audio files using [whisper.cpp](https://github.com/ggerganov/whisper.cpp). It supports both Docker-based and native execution.\n\n## Requirements\n\n- [Make](https://www.gnu.org/software/make/)\n- [Docker](https://docs.docker.com/get-docker/)\n- [FFmpeg](https://www.ffmpeg.org/download.html)\n\n## Usage\n\nClone the repository and initialize the required dependencies:\n\n```shell\nmake setup\n```\n\n**Optionally**, if you want AMD ROCm support to use your AMD GPU* just run:\n\n```shell\nWHISPER_HIPBLAS=1 make setup\n```\n\n*If your GPU is not officially supported don't forget to set the `HSA_OVERRIDE_GFX_VERSION` environment variable. More info [here](https://github.com/ollama/ollama/blob/main/docs/gpu.md#overrides).\n\n### Download models\n\nDownloads the necessary models for transcription:\n\n```shell\nmake download\n```\n\nDownload specific model (available model [here](https://github.com/ggerganov/whisper.cpp/tree/master/models#available-models)):\n\n```shell\nmake download model=tiny\n```\n\nBy default, it uses Docker. To disable Docker:\n\n```shell\nDOCKER_ENABLED=no make download model=tiny\n```\n\n### Convert to .wav (optional)\n\nConverts an input audio file to WAV format (currently `whisper.cpp` runs only with 16-bit WAV files, so make sure to convert your input before running the tool):\n\n```shell\nmake convert-to-wav input=audios/jfk.mp3 output=audios/jfk.wav\n```\n\n### Transcribe audio\n\nTranscribes the `.wav` audio file under `audios` directory using the specified model and language:\n\n```shell\nmake transcribe model=small.en lang=en file=audios/jfk.wav\n```\n\nBy default, it utilizes Docker for transcription. To opt for native execution:\n\n```shell\nDOCKER_ENABLED=no make transcribe model=small.en lang=en file=audios/jfk.wav\n```\n\nTo run in your unsupported AMD GPU, just override the LLVM target. Example:\n\n```shell\nHSA_OVERRIDE_GFX_VERSION=10.3.0 DOCKER_ENABLED=no make transcribe model=small.en lang=en file=audios/jfk.wav\n```\n\nAll methods generate `.srt`, `.lrt` and `.txt` transcription files.\n\n### Convert to video\n\nConverts the transcribed text into a video file with subtitles:\n\n```shell\nmake convert-to-video input=audios/jfk.wav\n```\n\n## Useful Links\n\n- [whisper.cpp](https://github.com/ggerganov/whisper.cpp)\n\n## License\n\nThis project is licensed under the [MIT license](LICENSE).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmadh93%2Fwhisper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmadh93%2Fwhisper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmadh93%2Fwhisper/lists"}