https://github.com/toLSC/tolsc-speech-to-text

Speech to text service for toLSC app implemented with OpenAI Whisper model
https://github.com/toLSC/tolsc-speech-to-text

fastapi python speech-recognition speech-to-text tts whisper

Last synced: over 1 year ago
JSON representation

Speech to text service for toLSC app implemented with OpenAI Whisper model

Host: GitHub
URL: https://github.com/toLSC/tolsc-speech-to-text
Owner: toLSC
Created: 2023-04-22T18:44:30.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2023-05-19T03:51:53.000Z (about 3 years ago)
Last Synced: 2024-10-24T10:07:45.465Z (over 1 year ago)
Topics: fastapi, python, speech-recognition, speech-to-text, tts, whisper
Language: Python
Homepage:
Size: 5.86 KB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# toLSC Speech To Text API 🎙️📝

Este repositorio contiene la aplicación de Speech to Text desarrollada en FastAPI con el modelo Whisper. La aplicación convierte el habla en texto utilizando técnicas de procesamiento de voz.

## Instrucciones de Uso

1. Construye la imagen de Docker ejecutando el siguiente comando:

```sh
docker build -t tolsc-s2t
```

2. Inicia el contenedor de Docker con la imagen creada:

```sh
docker run -dp8000:8000 tolsc-s2t
```

3. Realiza las peticiones a través de la API utilizando multipart/formdata con método POST con la llave "audio_file" de la siguiente manera:

```sh
http://your.deploy.ip:8000/speech-to-text
```

También puedes realizar la petición utilizando el siguiente comando de cURL:

```sh
curl -X POST -F "audio_file=@audio.m4a" http://your.deploy.ip:8000/speech-to-text
```

Ten en cuenta que la API solo acepta archivos de audio en formato `.m4a`.

## Colaboradores

- Santiago Fernández (sa.fernandez@javeriana.edu.co)
- Fabian Olarte (olarte_fabian@javeriana.edu.co)
- Mateo Rosero (roseroq-j@javeriana.edu.co)
- Andrés Vásquez (af.vasquezr@javeriana.edu.co)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/toLSC/tolsc-speech-to-text

Awesome Lists containing this project

README