Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mdtanrikulu/lora-voice
NCLM (Neural Codec Language Model) based voice communication over LoRa
https://github.com/mdtanrikulu/lora-voice
Last synced: 4 days ago
JSON representation
NCLM (Neural Codec Language Model) based voice communication over LoRa
- Host: GitHub
- URL: https://github.com/mdtanrikulu/lora-voice
- Owner: mdtanrikulu
- Created: 2024-01-07T17:04:27.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-02-03T18:50:04.000Z (9 months ago)
- Last Synced: 2024-04-14T10:52:41.893Z (7 months ago)
- Language: Python
- Size: 824 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# NCLM (Neural Codec Language Model) based voice communication over LoRa
This application enables voice communication over the LoRa protocol by converting speech to text using the [Faster-Whisper](https://github.com/SYSTRAN/faster-whisper) STT, transmitting the text data via LoRa, and then reconstructing the voice on the recipient's end with VALL-E X NCLM. It offers a practical solution for clear voice transmission over long distances using LoRa networks.
## Installation
To install the dependencies, run:
```
pip install -r requirements.txt
```note: under `venv` meshtastic may need extra PATH configuration
## Usage
To initiate real-time transcription and translation (soon), execute the `main.py` script with the necessary arguments. For guidance on command usage and argument options, refer to the example provided within the file.
The application provides various customization options for tailoring the transcription process to your preferences. You can select the model, set the target language for translation, choose microphone settings, among other configurations. To explore and adjust these options, review and modify the arguments in the main.py script according to your requirements.