https://github.com/heyfoz/python-openai-whisper

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
https://github.com/heyfoz/python-openai-whisper

ai api audio-transcription openai python speech-to-text whisper

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/heyfoz/python-openai-whisper
Owner: heyfoz
License: mit
Created: 2024-05-24T06:42:25.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-05-30T12:15:13.000Z (about 1 year ago)
Last Synced: 2025-02-12T09:59:03.547Z (5 months ago)
Topics: ai, api, audio-transcription, openai, python, speech-to-text, whisper
Language: Python
Homepage:
Size: 12.7 KB
Stars: 1
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Python OpenAI Whisper Speech to Text Transcription

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result can be returned to the console as text or VTT (WebVTT) format.

## Installation

Before using the script, make sure to install the OpenAI Python client library. You can install it using pip:

```bash
pip install openai
```

### Usage

Clone this repository:

```bash
git clone https://github.com/ffm5113/python_openai_whisper.git
```

Navigate to the project directory:

```bash
cd your_repository
```

Run the script:

```bash
python whisper_local.py
```

Follow the prompts to enter the file path of the audio file and choose the desired response format (text or vtt).

**Note:** To access the OpenAI API, you will need an API key. File size is limited to 25 MB. Please refer to the OpenAI API documentation for instructions on how to obtain and use the API key.

### Documentation

- [OpenAI API Reference](https://platform.openai.com/docs/api-reference/audio) - Refer to the official documentation for more details on the OpenAI audio API.
- [Whisper Model Repository](https://github.com/openai/whisper) - Repository for the open source Whisper model that powers the OpenAI API.
- [Introducing Whisper](https://openai.com/index/whisper/) - Learn more about the Whisper model on the OpenAI website.

### License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/heyfoz/python-openai-whisper

Awesome Lists containing this project

README