Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/saurabhchalke/whisper-meta-quest
Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model
https://github.com/saurabhchalke/whisper-meta-quest
artificial-intelligence automatic-speech-recognition mixed-reality speech-to-text virtual-reality vr whisper
Last synced: 4 days ago
JSON representation
Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model
- Host: GitHub
- URL: https://github.com/saurabhchalke/whisper-meta-quest
- Owner: saurabhchalke
- License: mit
- Created: 2024-07-31T07:21:46.000Z (5 months ago)
- Default Branch: master
- Last Pushed: 2024-07-31T09:48:24.000Z (5 months ago)
- Last Synced: 2024-11-08T08:44:50.331Z (about 2 months ago)
- Topics: artificial-intelligence, automatic-speech-recognition, mixed-reality, speech-to-text, virtual-reality, vr, whisper
- Language: C#
- Homepage:
- Size: 98.1 MB
- Stars: 21
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.MD
Awesome Lists containing this project
README
# whisper.unity on Meta Quest 3
[![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://opensource.org/licenses/MIT) [![whisper.cpp](https://img.shields.io/badge/whisper.cpp-v1.5.5-green)](https://github.com/ggerganov/whisper.cpp/releases/tag/v1.5.5)This project integrates Unity3D bindings for [whisper.cpp](https://github.com/ggerganov/whisper.cpp) to run OpenAI's Whisper ASR model locally on Meta Quest 3. Based on the original [whisper.unity](https://github.com/macoron/whisper.unity) repository by @Macoron.
## Features
- High-performance inference of Whisper ASR model
- Supports around 60 languages
- Can translate between languages
- Runs entirely on Meta Quest 3 without Internet connection
- Free and open source## Project Details
This project uses the Whisper model to transcribe a sample scene with JFK's audio file containing his famous line: "Ask not what your country can do for you – ask what you can do for your country." The transcription tests were conducted to measure latency on the Meta Quest 3 headset.### Latency Comparison
![Latency Comparison](latency_comparison.png)## Getting Started
Clone this repository and open it as a regular Unity project. It comes with examples and a tiny multilingual model.Alternatively, add this repository to your project as a **Unity Package** using the following git URL:
https://github.com/Macoron/whisper.unity.git?path=/Packages/com.whisper.unity
### Downloading Other Model Weights
You can try different Whisper model weights to improve transcription quality. Download model weights from [here](https://huggingface.co/ggerganov/whisper.cpp) and place them in your `StreamingAssets` folder.## License
This project is licensed under the MIT License. It uses compiled libraries and model weights from [whisper.cpp](https://github.com/ggerganov/whisper.cpp), also licensed under MIT.