Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/saurabhchalke/whisper-meta-quest

Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model
https://github.com/saurabhchalke/whisper-meta-quest

artificial-intelligence automatic-speech-recognition mixed-reality speech-to-text virtual-reality vr whisper

Last synced: 4 days ago
JSON representation

Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model

Awesome Lists containing this project

README

        

# whisper.unity on Meta Quest 3
[![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://opensource.org/licenses/MIT) [![whisper.cpp](https://img.shields.io/badge/whisper.cpp-v1.5.5-green)](https://github.com/ggerganov/whisper.cpp/releases/tag/v1.5.5)

This project integrates Unity3D bindings for [whisper.cpp](https://github.com/ggerganov/whisper.cpp) to run OpenAI's Whisper ASR model locally on Meta Quest 3. Based on the original [whisper.unity](https://github.com/macoron/whisper.unity) repository by @Macoron.

## Features
- High-performance inference of Whisper ASR model
- Supports around 60 languages
- Can translate between languages
- Runs entirely on Meta Quest 3 without Internet connection
- Free and open source

## Project Details
This project uses the Whisper model to transcribe a sample scene with JFK's audio file containing his famous line: "Ask not what your country can do for you – ask what you can do for your country." The transcription tests were conducted to measure latency on the Meta Quest 3 headset.

### Latency Comparison
![Latency Comparison](latency_comparison.png)

## Getting Started
Clone this repository and open it as a regular Unity project. It comes with examples and a tiny multilingual model.

Alternatively, add this repository to your project as a **Unity Package** using the following git URL:

https://github.com/Macoron/whisper.unity.git?path=/Packages/com.whisper.unity

### Downloading Other Model Weights
You can try different Whisper model weights to improve transcription quality. Download model weights from [here](https://huggingface.co/ggerganov/whisper.cpp) and place them in your `StreamingAssets` folder.

## License
This project is licensed under the MIT License. It uses compiled libraries and model weights from [whisper.cpp](https://github.com/ggerganov/whisper.cpp), also licensed under MIT.