Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ancs21/awesome-openai-whisper

A curated list of awesome OpenAI's Whisper
https://github.com/ancs21/awesome-openai-whisper

List: awesome-openai-whisper

artificial-intelligence awesome awesome-list awesome-whisper deep-learning machine-learning openai speech speech-recognition speech-to-text state-of-the-art whisper

Last synced: about 2 months ago
JSON representation

A curated list of awesome OpenAI's Whisper

Awesome Lists containing this project

README

        

# awesome-openai-whisper
A curated list of awesome OpenAI's Whisper

## General Resources
* [Introducing Whisper](https://openai.com/blog/whisper/)
* [Whisper Paper](https://cdn.openai.com/papers/whisper.pdf)
* [Whisper Code](https://github.com/openai/whisper)
* [Introducing ChatGPT and Whisper APIs](https://openai.com/blog/introducing-chatgpt-and-whisper-apis)

## API Ready / Playground / Demo
* [whisperx](https://replicate.com/daanelson/whisperx)
* [WHISPER+](https://www.oneai.com/speech-to-text)
* [Fine-Tuned Whisper API](https://www.assemblyai.com/)
* [openai/whisper – Run with an API on Replicate](https://replicate.com/openai/whisper)
* [Whisper - a Hugging Face Space by openai](https://huggingface.co/spaces/openai/whisper)
* [Whisper Playground](https://whisperui.monsterapi.ai/)
* [Source](https://github.com/saharmor/whisper-playground)
* [Web Whisper - 🎶 Convert any audio to text 📝](https://whisper.r3d.red)
* [Source](https://codeberg.org/pluja/web-whisper)

## Model Variants
* [whisper-timestamped - Whisper with word-level timestamps and confidence ](https://github.com/linto-ai/whisper-timestamped)
* [whisper.cpp - Port of OpenAI's Whisper model in C/C++](https://github.com/ggerganov/whisper.cpp)
* [pywhispercpp - Python bindings for whisper.cpp ](https://github.com/abdeladim-s/pywhispercpp)
* [Faster Whisper - reimplementation using CTranslate2 up to 4 times faster](https://github.com/guillaumekln/faster-whisper)
* [Whisper JAX - optimised JAX code, largely built on the hugs Hugging Face Transformers Whisper implementation, over 70x faster](https://github.com/sanchit-gandhi/whisper-jax/)
* [whisper.tflite](https://github.com/usefulsensors/openai-whisper)
* [OpenAI Whisper - CPU](https://github.com/MiscellaneousStuff/openai-whisper-cpu)
* [whisper_onnx](https://github.com/Fhrozen/whisper_onnx)
* [whisper-export - openvino version of openai/whisper](https://github.com/axinc-ai/whisper-export)
* [onnx-export](https://github.com/axinc-ai/whisper-export/tree/onnx-export)
* [Whisper OpenVINO](https://github.com/zhuzilin/whisper-openvino)
* [Whisper models on Hugging Face](https://huggingface.co/models?other=whisper)

## Applications
* [React hook for OpenAI Whisper](https://github.com/chengsokdara/use-whisper)
* [🎞️ Subtitles generation tool (Web-UI + CLI + Python package)](https://github.com/abdeladim-s/subsai)
* [Whisper as a Service (GUI and API for OpenAI Whisper)](https://github.com/schibsted/WAAS)
* [WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.](https://github.com/m-bain/whisperX)
* [stable-ts - Stabilizing Timestamps for Whisper](https://github.com/jianfch/stable-ts)
* [buzz - Buzz transcribes audio from your computer's microphones to text using OpenAI's Whisper](https://github.com/chidiwilliams/buzz)
* [whispering - Streaming transcriber with whisper](https://github.com/shirayu/whispering)
* [whisper-youtube - 🔉 Youtube Videos Transcription with OpenAI's Whisper](https://github.com/ArthurFDLR/whisper-youtube)
* [Speaker Identification - Pyannote plays and Whisper rhymes](https://github.com/Majdoddin/nlp)
* [Automatic YouTube subtitle generation](https://github.com/m1guelpf/yt-whisper)
* [Whisper Webui - WebUI for Whisper that can transcribe and translate audio](https://gitlab.com/aadnk/whisper-webui/)
* [AutoCut - generate video subtitles and edit the video by selecting subtitle clips](https://github.com/mli/autocut)
* [AutoCut Client](https://github.com/zcf0508/autocut-client)
* [Whisper Playground - Build real time speech2text web apps using OpenAI's Whisper](https://github.com/saharmor/whisper-playground)
* [Subtitle Edit - a subtitle editor supporting audio to text (speech recognition) via Whisper or Vosk/Kaldi](https://www.nikse.dk/subtitleedit)
* [WEB WHISPER - A light user interface for OpenAI's Whisper right into your browser!](https://codeberg.org/pluja/web-whisper)
* [Whisper Mic - Project that allows one to use a microphone with OpenAI whisper](https://github.com/mallorbc/whisper_mic)
* [Android Whisper ASR App](https://play.google.com/store/apps/details?id=com.whisper.android.tflitecpp)
* [Source](https://github.com/usefulsensors/openai-whisper/tree/main/android_app)
* [Apple Whisper ASR App](https://apps.apple.com/in/app/whisper-asr/id6444556326)
* [💬 ASR FastAPI](https://github.com/Wordcab/wordcab-transcribe)

## Videos
* [OpenAI Whisper - MultiLingual AI Speech Recognition Live App Tutorial](https://www.youtube.com/watch?v=ywIyc8l1K1Q)
* [Complete Tutorial Video for OpenAI's Whisper Model for Windows Users](https://www.youtube.com/watch?v=msj3wuYf3d8)
* [Open AI’s Whisper is Amazing!](https://www.youtube.com/watch?v=OCBZtgQGt1I)
* [How to Use OpenAI Whisper to Fix YouTube Search](https://www.youtube.com/watch?v=vpU_6x3jowg)

## Tutorials
* [Convert Podcasts to Text With OpenAI’s Whisper API Using Python](https://betterprogramming.pub/openais-whisper-tutorial-42140dd696ee)
* [Create your own speech to text application with Whisper from OpenAI and Flask](https://blog.paperspace.com/whisper-openai-flask-application-deployment/)
* [How to Run OpenAI’s Whisper Speech Recognition Model](https://www.assemblyai.com/blog/how-to-run-openais-whisper-speech-recognition-model/)
* [Speech-to-Text with OpenAI’s Whisper](https://towardsdatascience.com/speech-to-text-with-openais-whisper-53d5cea9005e)

## Articles
* [Whispers of A.I.’s Modular Future](https://www.newyorker.com/tech/annals-of-technology/whispers-of-ais-modular-future)
* [OpenAI open-sources Whisper, a multilingual speech recognition system](https://techcrunch.com/2022/09/21/openai-open-sources-whisper-a-multilingual-speech-recognition-system/)
* [OpenAI Releases 1.6 Billion Parameter Multilingual Speech Recognition AI Whisper](https://www.infoq.com/news/2022/10/openai-whisper-speech/)
* [OpenAI Releases Whisper: A New Open-Source Machine Learning Model For Multi-Lingual Automatic Speech Recognition](https://www.marktechpost.com/2022/09/27/openai-releases-whisper-a-new-open-source-machine-learning-model-for-multi-lingual-automatic-speech-recognition/)