Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kiritoind/youtube_audio_transcripter
Youtube Audio transcription with WhisperAi , The script downloads audio from a YouTube video and transcribes them into text.
https://github.com/kiritoind/youtube_audio_transcripter
jupyter-notebook machine-learning speech-recognition speech-to-text whisper-ai
Last synced: 6 days ago
JSON representation
Youtube Audio transcription with WhisperAi , The script downloads audio from a YouTube video and transcribes them into text.
- Host: GitHub
- URL: https://github.com/kiritoind/youtube_audio_transcripter
- Owner: kiritoInd
- License: mit
- Created: 2024-02-16T13:09:20.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2024-02-19T07:17:07.000Z (9 months ago)
- Last Synced: 2024-02-19T14:37:25.819Z (9 months ago)
- Topics: jupyter-notebook, machine-learning, speech-recognition, speech-to-text, whisper-ai
- Language: Python
- Homepage:
- Size: 19.5 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# README: Transcribing Audio Segments with OpenAI's Whisper Model
---
## Steps
`Clone the Repo`
`run the Run.py`
`Input a csv file with youtube links in rows`
![Step 2](./images/st1.png)
`Generate the subtitles`
`Download the csv file after completed`
![Step 3](./images/st3.png)
`Ouput`
![Output](./images/ot.png)
---
## Introduction
This README provides an overview and instructions for using the provided Python script to transcribe audio segments using OpenAI's Whisper model. The script downloads audio segments from a YouTube video and transcribes them into text.
---
## Dependencies
Before running the script, ensure you have the following dependencies installed:
- Python 3.x
- PyTube (`pip install pytube`)
- MoviePy (`pip install moviepy`)
- Whisper (`pip install openai-whisper`)---
## Usage
1. **Clone the Repository:**
Clone the repository containing the Python script to your local machine.
2. **Run the Script:**
Open a terminal or command prompt and navigate to the directory containing the Python script.
Run the script using the following command:
3. **Follow the Instructions:**
- The script will prompt you to enter the URL of the YouTube video you want to transcribe.
- It will download the audio segments from the video and transcribe them using OpenAI's Whisper model.
- The detected language and recognized text for each audio segment will be printed in the terminal.---
## Understanding the Script
- The script leverages the PyTube library to download audio segments from the specified YouTube video.
- It utilizes the Whisper model provided by OpenAI to transcribe the audio segments.
- Audio segments are processed using log-Mel spectrograms before being transcribed.
- The script outputs the detected language and recognized text for each audio segment.---
## Notes
- Ensure you have an active internet connection to download the YouTube video and utilize the Whisper model.
- The provided script assumes that the audio segments are saved as "audio1.mp3", "audio2.mp3", and so on in the current directory.
- Make sure you have appropriate permissions to download and process the audio segments.---
## Credits
- OpenAI for providing the Whisper model.
- PyTube and MoviePy developers for creating useful libraries for working with YouTube videos and audio processing in Python.---
## Disclaimer
- This script is provided as-is and may require adjustments based on your specific requirements and environment.
- Use the script responsibly and ensure compliance with YouTube's terms of service and copyright policies.---
Feel free to reach out for further assistance or clarification. Happy transcribing!