Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/heyfoz/python-openai-whisper
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
https://github.com/heyfoz/python-openai-whisper
ai api audio-transcription openai python speech-to-text whisper
Last synced: 14 days ago
JSON representation
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
- Host: GitHub
- URL: https://github.com/heyfoz/python-openai-whisper
- Owner: heyfoz
- License: mit
- Created: 2024-05-24T06:42:25.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-05-30T12:15:13.000Z (6 months ago)
- Last Synced: 2024-10-10T19:20:21.872Z (about 1 month ago)
- Topics: ai, api, audio-transcription, openai, python, speech-to-text, whisper
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Python OpenAI Whisper Speech to Text Transcription
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result can be returned to the console as text or VTT (WebVTT) format.
## Installation
Before using the script, make sure to install the OpenAI Python client library. You can install it using pip:
```bash
pip install openai
```### Usage
Clone this repository:
```bash
git clone https://github.com/ffm5113/python_openai_whisper.git
```Navigate to the project directory:
```bash
cd your_repository
```Run the script:
```bash
python whisper_local.py
```Follow the prompts to enter the file path of the audio file and choose the desired response format (text or vtt).
**Note:** To access the OpenAI API, you will need an API key. File size is limited to 25 MB. Please refer to the OpenAI API documentation for instructions on how to obtain and use the API key.
### Documentation
- [OpenAI API Reference](https://platform.openai.com/docs/api-reference/audio) - Refer to the official documentation for more details on the OpenAI audio API.
- [Whisper Model Repository](https://github.com/openai/whisper) - Repository for the open source Whisper model that powers the OpenAI API.
- [Introducing Whisper](https://openai.com/index/whisper/) - Learn more about the Whisper model on the OpenAI website.### License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.