Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sindresorhus/awesome-whisper
๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI
https://github.com/sindresorhus/awesome-whisper
List: awesome-whisper
ai artificial-intelligence awesome awesome-list gpt openai speech-to-text transcription
Last synced: about 1 month ago
JSON representation
๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI
- Host: GitHub
- URL: https://github.com/sindresorhus/awesome-whisper
- Owner: sindresorhus
- License: cc0-1.0
- Created: 2023-05-10T10:46:57.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-04-05T08:09:47.000Z (7 months ago)
- Last Synced: 2024-04-14T04:18:50.603Z (7 months ago)
- Topics: ai, artificial-intelligence, awesome, awesome-list, gpt, openai, speech-to-text, transcription
- Homepage:
- Size: 1.14 MB
- Stars: 974
- Watchers: 20
- Forks: 49
- Open Issues: 1
-
Metadata Files:
- Readme: readme.md
- Contributing: contributing.md
- License: license
- Code of conduct: code-of-conduct.md
Awesome Lists containing this project
- fucking-awesome - Whisper - Open-source AI-powered speech recognition system developed by OpenAI. (Miscellaneous)
- awesome - Whisper - Open-source AI-powered speech recognition system developed by OpenAI. (Miscellaneous)
- awesome - sindresorhus/awesome-whisper - ๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI (Others)
- awesome-chatgpt - awesome-whisper - Whisper is an AI-powered speech recognition system. (Related lists / Go)
- ultimate-awesome - awesome-whisper - ๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI. (Other Lists / PowerShell Lists)
README
## Contents
- [Official](#official)
- [Model variants](#model-variants)
- [Apps](#apps)
- [Web apps](#web-apps)
- [CLI tools](#cli-tools)
- [Playgrounds](#playgrounds)
- [Packages](#packages)
- [Articles](#articles)
- [Videos](#videos)
- [Community](#community)
- [Third-party APIs](#third-party-apis)
- [Related lists](#related-lists)## Official
- [Introduction](https://openai.com/research/whisper)
- [Source code](https://github.com/openai/whisper)
- [White paper](https://cdn.openai.com/papers/whisper.pdf)## Model variants
- [Whisper.cpp](https://github.com/ggerganov/whisper.cpp) - Port of Whisper in C++.
- [Bindings for many languages](https://github.com/ggerganov/whisper.cpp#bindings)
- [WhisperX](https://github.com/m-bain/whisperX) - Adds fast automatic speaker recognition with word-level timestamps and speaker diarization.
- [faster-whisper](https://github.com/guillaumekln/faster-whisper) - Faster reimplementation of Whisper using CTranslate2.
- [Whisper JAX](https://github.com/sanchit-gandhi/whisper-jax) - JAX implementation of Whisper for up to 70x speed-up on TPU.
- [whisper-timestamped](https://github.com/linto-ai/whisper-timestamped) - Adds word-level timestamps and confidence scores.
- [whisper-openvino](https://github.com/zhuzilin/whisper-openvino) - Whisper running on OpenVINO.
- [whisper.tflite](https://github.com/usefulsensors/openai-whisper) - Whisper running on TensorFlow Lite.
- [Whisper variants](https://huggingface.co/models?other=whisper) - Various Whisper variants on Hugging Faces.
- [Whisper-AT](https://github.com/YuanGongND/whisper-at) - Whisper that can recognize non-speech audio events in addition to speech.## Apps
- [Aiko](https://sindresorhus.com/aiko) - Audio transcription iOS and macOS app.
- [MacWhisper](https://goodsnooze.gumroad.com/l/macwhisper) - Audio transcription macOS app. (Freemium)
- [Whisper Memos](https://apps.apple.com/app/id6443658039) - Audio transcription iOS app. (Freemium)
- [FourYou](https://apps.apple.com/app/id1671616134) - Audio journal iOS app.
- [Jojo Transcribe](https://apps.apple.com/app/id1659864300) - Audio transcription macOS app.
- [Buzz](https://github.com/chidiwilliams/Buzz) - Audio transcription and translation macOS app.
- [WhisperScript](https://store.getwavery.com/l/whisperscript) - Audio transcription macOS app. (Freemium ยท Electron)
- [Audio Podium](https://apps.apple.com/app/id6449008295) - Audio/video management macOS app.
- [superwhisper](https://superwhisper.com) - Global audio transcription macOS menu bar app.
- [Speech Note](https://github.com/mkiol/dsnote) - Audio transcription Linux app.
- [FridayGPT](https://www.fridaygpt.app) - Dictation macOS app powered by OpenAI API.
- [EasyWhisper](https://easywhisper.io) - Windows and macOS app for audio transcription and speaker diarization. (Freemium)## Web apps
### Hosted
- [bigWav](https://bigwav.app) - Audio transcription and annotation tool.
- [Free Podcast Transcription](https://freepodcasttranscription.com) - Runs locally in your browser.
- [Gladia](https://www.gladia.io) - Transcription with real-time processing.### Self-hosted
- [Subs AI](https://github.com/abdeladim-s/subsai) - Subtitle generation.
- [WaaS](https://github.com/schibsted/WAAS) - GUI and API for Whisper.
- [writeout.ai](https://github.com/beyondcode/writeout.ai) - Laravel app to transcribe and translate audio files.
- [Meeper](https://github.com/pas1ko/meeper) - Transcriptions, summary and more for meetings and any browser tab. (Chrome app)## CLI tools
- [yt-whisper](https://github.com/m1guelpf/yt-whisper) - YouTube subtitle generation.
- [phonix](https://github.com/platisd/phonix) - Generate captions for videos.
- [whisper-standalone-win](https://github.com/Purfview/whisper-standalone-win) - Standalone Windows executable for Whisper and Faster Whisper.
- [whisper-ctranslate2](https://github.com/Softcatala/whisper-ctranslate2) - Whisper command-line tool based on CTranslate2, compatible with the original.
- [insanely-fast-whisper-cli](https://github.com/ochen1/insanely-fast-whisper-cli) - Achieve transcription speeds near 30x real-time with several optimizations.
- [whisper-diarization](https://github.com/MahmoudAshraf97/whisper-diarization) - Automatic speech recognition with speaker diarization.## Playgrounds
- [Hugging Faces](https://huggingface.co/spaces/openai/whisper) - Whisper demo running on Hugging Faces. ([Source](https://huggingface.co/spaces/openai/whisper/tree/main))
- [Monster API](https://whisperui.monsterapi.ai) - Whisper demo running on Monster API. ([Source](https://github.com/saharmor/whisper-playground))
- [Web Whisper](https://whisper.r3d.red) - Whisper demo by Pluja. ([Source](https://codeberg.org/pluja/web-whisper))
- [YouTube Video Transcription](https://github.com/ArthurFDLR/whisper-youtube) - Running on Colab.## Packages
### JavaScript
- [use-whisper](https://github.com/chengsokdara/use-whisper) - React hook.
## Articles
- [Whispers of A.I.'s Modular Future](https://www.newyorker.com/tech/annals-of-technology/whispers-of-ais-modular-future) - The future of machine learning lies in adaptable and accessible open-source speech-transcription programs.
- [How to Run Whisper Speech Recognition Model](https://www.assemblyai.com/blog/how-to-run-openais-whisper-speech-recognition-model/) - Explains how to install and run the model, as well as providing a performance analysis comparing Whisper to other models.
- [Create your own speech to text app using Flask](https://blog.paperspace.com/whisper-openai-flask-application-deployment/) - The tutorial demonstrates Whisper's speech-to-text model, with a demo on running it in a Gradient Notebook and a guide for setting up a Flask app with Gradient Deployments.
- [Convert Podcasts to Text](https://betterprogramming.pub/openais-whisper-tutorial-42140dd696ee) - Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology.## Videos
- [Open AI's Whisper is Amazing!](https://www.youtube.com/watch?v=OCBZtgQGt1I) - Introduction to Whisper.
- [How to do Free Speech-to-Text Transcription Better Than Google Premium API](https://www.youtube.com/watch?v=msj3wuYf3d8) - Tutorial.
- [Multilingual AI Speech Recognition Live App](https://www.youtube.com/watch?v=ywIyc8l1K1Q) - Tutorial.## Community
- [Discussions](https://github.com/openai/whisper/discussions)
- [Discord](https://discord.com/invite/openai)## Third-party APIs
*APIs that use Whisper.*
- [Whisper+](https://www.oneai.com/speech-to-text) - Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation.
- [Replicate](https://replicate.com/openai/whisper) - Use Whisper running on Replicate.## Related lists
- [awesome-chatgpt](https://github.com/sindresorhus/awesome-chatgpt) - ChatGPT resources.