Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/meganetaaan/simple-stt-server

A simple text-to-speech server that uses VOSK to recognize speech and send it over WebSocket
https://github.com/meganetaaan/simple-stt-server

nodejs speech-recognition speech-to-text vosk

Last synced: 2 months ago
JSON representation

A simple text-to-speech server that uses VOSK to recognize speech and send it over WebSocket

Host: GitHub
URL: https://github.com/meganetaaan/simple-stt-server
Owner: meganetaaan
Created: 2022-05-13T15:29:29.000Z (over 2 years ago)
Default Branch: master
Last Pushed: 2024-06-05T09:24:14.000Z (7 months ago)
Last Synced: 2024-10-11T13:42:30.284Z (3 months ago)
Topics: nodejs, speech-recognition, speech-to-text, vosk
Language: HTML
Homepage:
Size: 42 KB
Stars: 2
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Simple VOSK WebSocket Server

[日本語](./README_ja.md) | English

This project is a Node.js server that uses VOSK to recognize speech and send it over WebSocket.

## Installation

1. Clone this repository.
2. Navigate to the project root directory.
3. Run `npm install` to install dependencies.
4. Download and extract [VOSK model files](https://alphacephei.com/vosk/models) and put them in the `model` directory. The directory structure should look like the following:

```
[sskw]$ ls -l model
合計 24
-rw-r--r-- 1 sskw sskw 898 7月 9 2022 README
drwxr-xr-x 2 sskw sskw 4096 7月 9 2022 am
drwxr-xr-x 2 sskw sskw 4096 7月 9 2022 conf
drwxr-xr-x 3 sskw sskw 4096 7月 9 2022 graph
drwxr-xr-x 2 sskw sskw 4096 7月 9 2022 ivector
drwxr-xr-x 2 sskw sskw 4096 7月 9 2022 rescore
```

## Launch

1. Navigate to the project root directory.
2. Run `npm start` to start the server.

## Usage

Once the server is started, it runs a WebSocket server on port 8080. When a client connects to the WebSocket, recognized text is sent to the client.

To pause the sending of recognition results, press the space key. To resume, press the space key again.

## License

MIT License