Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
https://github.com/ahmetoner/whisper-asr-webservice
asr automatic-speech-recognition docker openai-whisper speech speech-recognition speech-to-text
Last synced: about 1 month ago
JSON representation
OpenAI Whisper ASR Webservice API
- Host: GitHub
- URL: https://github.com/ahmetoner/whisper-asr-webservice
- Owner: ahmetoner
- License: mit
- Created: 2022-09-22T14:26:49.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-08-07T23:57:54.000Z (3 months ago)
- Last Synced: 2024-09-26T22:40:15.425Z (about 2 months ago)
- Topics: asr, automatic-speech-recognition, docker, openai-whisper, speech, speech-recognition, speech-to-text
- Language: Python
- Homepage: https://ahmetoner.github.io/whisper-asr-webservice
- Size: 1.34 MB
- Stars: 1,986
- Watchers: 30
- Forks: 354
- Open Issues: 55
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Funding: .github/FUNDING.yml
Awesome Lists containing this project
README
![Release](https://img.shields.io/github/v/release/ahmetoner/whisper-asr-webservice.svg)
![Docker Pulls](https://img.shields.io/docker/pulls/onerahmet/openai-whisper-asr-webservice.svg)
![Build](https://img.shields.io/github/actions/workflow/status/ahmetoner/whisper-asr-webservice/docker-publish.yml.svg)
![Licence](https://img.shields.io/github/license/ahmetoner/whisper-asr-webservice.svg)# Whisper ASR Webservice
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitask model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: [github.com/openai/whisper](https://github.com/openai/whisper/)
## Features
Current release (v1.6.0) supports following whisper models:
- [openai/whisper](https://github.com/openai/whisper)@[v20240930](https://github.com/openai/whisper/releases/tag/v20240930)
- [SYSTRAN/faster-whisper](https://github.com/SYSTRAN/faster-whisper)@[v1.0.3](https://github.com/SYSTRAN/faster-whisper/releases/tag/1.0.3)## Quick Usage
### CPU
```sh
docker run -d -p 9000:9000 -e ASR_MODEL=base -e ASR_ENGINE=openai_whisper onerahmet/openai-whisper-asr-webservice:latest
```### GPU
```sh
docker run -d --gpus all -p 9000:9000 -e ASR_MODEL=base -e ASR_ENGINE=openai_whisper onerahmet/openai-whisper-asr-webservice:latest-gpu
```for more information:
- [Documentation/Run](https://ahmetoner.github.io/whisper-asr-webservice/run)
- [Docker Hub](https://hub.docker.com/r/onerahmet/openai-whisper-asr-webservice)## Documentation
Explore the documentation by clicking [here](https://ahmetoner.github.io/whisper-asr-webservice).
## Credits
- This software uses libraries from the [FFmpeg](http://ffmpeg.org) project under the [LGPLv2.1](http://www.gnu.org/licenses/old-licenses/lgpl-2.1.html)