{"id":19092693,"url":"https://github.com/chloelavrat/speech-to-text-app","last_synced_at":"2026-05-04T01:32:06.142Z","repository":{"id":253231002,"uuid":"636698944","full_name":"chloelavrat/speech-to-text-app","owner":"chloelavrat","description":"Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.","archived":false,"fork":false,"pushed_at":"2024-08-01T22:13:13.000Z","size":7584,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-02T23:17:16.821Z","etag":null,"topics":["audio-processing","machine-learning","machinelearning","speech-to-text","streamlit","streamlit-webapp","stt","whisper","whisper-ai"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/chloelavrat.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-05-05T12:47:26.000Z","updated_at":"2024-08-15T08:50:27.000Z","dependencies_parsed_at":"2024-08-15T11:41:44.510Z","dependency_job_id":null,"html_url":"https://github.com/chloelavrat/speech-to-text-app","commit_stats":null,"previous_names":["chloelavrat/speech-to-text-app"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/chloelavrat%2Fspeech-to-text-app","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/chloelavrat%2Fspeech-to-text-app/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/chloelavrat%2Fspeech-to-text-app/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/chloelavrat%2Fspeech-to-text-app/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/chloelavrat","download_url":"https://codeload.github.com/chloelavrat/speech-to-text-app/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240142764,"owners_count":19754633,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio-processing","machine-learning","machinelearning","speech-to-text","streamlit","streamlit-webapp","stt","whisper","whisper-ai"],"created_at":"2024-11-09T03:21:25.241Z","updated_at":"2026-05-04T01:32:01.123Z","avatar_url":"https://github.com/chloelavrat.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n\u003cdiv align=\"center\"\u003e\n  \u003cimg src=\"./assets/README-banner.png\" alt=\"Banner\" style=\"border-radius: 17px; width: 100%; max-width: 800px; height: auto;\"\u003e\n\u003c/div\u003e\n\n\u003ch3 align=\"center\"\u003e\n  \u003cb\u003e\u003ca href=\"azerty-labs.com\"\u003eOnline App\u003c/a\u003e\u003c/b\u003e\n  •\n  \u003cb\u003e\u003ca href=\"#usage\"\u003eUsage\u003c/a\u003e\u003c/b\u003e\n  •\n  \u003cb\u003e\u003ca href=\"https://github.com/azerty-labs/speech-to-text-app/blob/main/doc/capture.png\"\u003eApp Screenshot\u003c/a\u003e\u003c/b\u003e\n\u003c/h3\u003e\n\n\u003cdiv align=\"center\"\u003e\n  \u003ca href=\"https://opensource.org/licenses/MIT\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/License-MIT-blue.svg\" alt=\"License\"\u003e\n  \u003c/a\u003e\n  \u003cimg src=\"https://img.shields.io/badge/python-3.7+-blue.svg\" alt=\"Python Versions\"\u003e\n\u003c/div\u003e\n\u003c/br\u003e\n\n\u003cp align=\"center\"\u003e\n  The \u003cb\u003eSpeech to Text App\u003c/b\u003e is a Streamlit application that allows users to transcribe speech to text from either an uploaded audio file or a YouTube video. The app leverages OpenAI's \u003ca href=\"https://github.com/openai/whisper\"\u003eWhisper\u003c/a\u003e speech-to-text library to provide accurate and efficient transcriptions.\n\u003c/p\u003e\n\n## Features\n\nThe Speech to Text App offers the following features:\n\n- **Transcribe Audio Files:** Upload an audio file (e.g., MP3, WAV) to transcribe the speech into text.\n- **Transcribe YouTube Videos:** Enter a YouTube link to automatically extract and transcribe the audio content.\n\n## How it Works\n\nThe app uses OpenAI's Whisper library to perform speech-to-text transcription. Whisper is a general-purpose speech recognition model trained on a large dataset of diverse audio. It can handle various languages and accents, making it a versatile tool for transcribing different types of speech.\n\nWhen a user uploads an audio file or provides a YouTube link, the app processes the audio through Whisper, which converts the spoken content into written text. This text is then displayed in the app's interface, allowing users to copy or save it as needed.\n\n## Run app locally\nIf you want to run the **speech-to-text-app** streamlit app locally. Multiple solution are possible :\n\n1. **Run the app directly in a virtual environnement**\n    \n    clone repository:\n    ```\n    git clone https://github.com/chloelavrat/speech-to-text-app.git\n    cd speech-to-text-app\n    ```\n    Create, activate and install your environnement:\n    ```\n    python -m venv .venv\n    source .venv/bin/activate\n    pip install -r requirements.txt\n    ```\n    Run the app using streamlit:\n    ```\n    streamlit run streamlit_app.py\n    ```\n\n2. **Run the app using docker**\n\n    Build the `speech-to-text-app` docker container:\n    ```\n    docker build -t speech-to-text-app .\n    ````\n    Run the container at a specified port (here 8501) \n    ```\n    docker run -p 8501:8501 speech-to-text-app\n    ```\n    Go to your web browser and access [localhost:8501](http://localhost:8501)\n\n## Contributing\n\nThis is an open-source project, and contributions are always welcome. If you would like to contribute to the project, you can do so by submitting a pull request or by creating an issue on the project's GitHub page.\n\n## License\n\nThis project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fchloelavrat%2Fspeech-to-text-app","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fchloelavrat%2Fspeech-to-text-app","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fchloelavrat%2Fspeech-to-text-app/lists"}