An open API service indexing awesome lists of open source software.

https://github.com/skit-ai/speech-recognition

SDKs and docs for Skit's speech to text service
https://github.com/skit-ai/speech-recognition

asr multilingual-speech-recognition speech-recognition speech-recognition-api speech-to-text

Last synced: about 1 month ago
JSON representation

SDKs and docs for Skit's speech to text service

Awesome Lists containing this project

README

        

# Speech-to-Text API
Converts audio to text

We support these ten indian languages ([language codes](https://github.com/Vernacular-ai/speech-recognition/blob/master/docs/types/RecognitionConfig.md#languagesupport)).
- Hindi
- English
- Marathi
- Kannada
- Malayalam
- Bengali
- Gujarati
- Punjabi
- Telugu
- Tamil

## Authentication
~~To get access to our APIs reach out to us at [email protected]~~
We do not provide public access token for the APIs anymore.

## Ways to use the Service
- Transcribing short audios [audios upto 1 min]
- Transcribing long audios [more than 1 min]
- Transcribing audio from streaming input

We recommend that you call this service using Vernacular provided client libraries. If your application needs to call this service using your own libraries, you should use the HTTP Endpoints.

**Supported SDKs**: [Python](https://github.com/Vernacular-ai/speech-recognition/tree/master/python)

## REST Reference

**ServiceHost:** https://asr.vernacular.ai

### Speech Recognition
| Name | Description |
|--|--|
| [recognize](docs/api_reference/Recognize.md) | Performs synchronous speech recognition: receive results after all audio has been sent and processed. |
| [longrunningrecognize](docs/api_reference/LongRunningRecognize.md) | Performs asynchronous speech recognition. Generally used for long audios |

## RPC Reference

### Speech Recognition
| Methods | Description |
|--|--|
|[Recognize](docs/rpc_reference/Recognize.md) | Performs synchronous speech recognition: receive results after all audio has been sent and processed.|
|[LongRunningRecognize](docs/rpc_reference/LongRunningRecognize.md) | Performs asynchronous speech recognition: receive results via the longrunning.Operations interface.|
|[StreamingRecognize](docs/rpc_reference/StreamingRecognize.md) |Performs streaming speech recognition: receive results while sending audio. Supports both unidirectional and bidirectional streaming.|