Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jech/galene-stt

Speech-to-text support for Galene
https://github.com/jech/galene-stt

galene stt videoconference webrtc whisper whisper-cpp

Last synced: 2 months ago
JSON representation

Speech-to-text support for Galene

Awesome Lists containing this project

README

        

# Speech-to-text support for Galene

Galene-stt is an experiment in real-time speech-to-text (automatic
subtitling) for the [Galene][1] videoconferencing server. Currently
Galene-stt simply dumps a transcript of the videoconference in the chat;
if the experiment is successful, we will extend the Galene protocol with
the ability to display subtitles.

Galene-stt connects to a Galene server using the same protocol as ordinary
clients. Since Galene-stt requires a fair amount of CPU, this allows
running on a powerful local machine without the risk to overload the
server.

## Installation

Build and install whisper.cpp:

```
git clone https://github.com/ggerganov/whisper.cpp
cd whisper.cpp
cmake -Bbuild
cd build
make -j
sudo make install
cd ..
```

Download your favourite model:
```
./models/download-ggml-model.sh small
cd ..
```

Build galene-stt:
```
git clone https://github.com/jech/galene-stt
cd galene-stt
go build
```

Put the models where galene-stt will find them:
```
ln -s ../whisper.cpp/models .
```

## Usage

```
./galene-stt https://galene.org:8443/group/public/stt
```

Galene-stt defaults to english; for other languages, use the `-lang` flag:
```
./galene-stt -lang fr https://galene.org:8443/group/public/stt
```

If galene-stt reports dropped audio, then your machine is not fast enough
for the selected model. Specify a faster model using the `-model`
command-line option:

```
./galene-stt -model models/ggml-tiny.bin \
https://galene.org:8443/group/public/stt
```

— Juliusz Chroboczek

[1]: https://galene.org