https://github.com/skit-ai/speech-recognition

SDKs and docs for Skit's speech to text service
https://github.com/skit-ai/speech-recognition

asr multilingual-speech-recognition speech-recognition speech-recognition-api speech-to-text

Last synced: about 1 month ago
JSON representation

SDKs and docs for Skit's speech to text service

Host: GitHub
URL: https://github.com/skit-ai/speech-recognition
Owner: skit-ai
License: apache-2.0
Created: 2020-02-07T07:32:57.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2023-07-05T21:00:23.000Z (almost 2 years ago)
Last Synced: 2025-03-25T09:17:57.325Z (about 2 months ago)
Topics: asr, multilingual-speech-recognition, speech-recognition, speech-recognition-api, speech-to-text
Language: Python
Homepage:
Size: 447 KB
Stars: 21
Watchers: 4
Forks: 5
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Speech-to-Text API

Converts audio to text

We support these ten indian languages ([language codes](https://github.com/Vernacular-ai/speech-recognition/blob/master/docs/types/RecognitionConfig.md#languagesupport)).

- Hindi

- English

- Marathi

- Kannada

- Malayalam

- Bengali

- Gujarati

- Punjabi

- Telugu

- Tamil

## Authentication

~~To get access to our APIs reach out to us at [email protected]~~

We do not provide public access token for the APIs anymore.

## Ways to use the Service

- Transcribing short audios [audios upto 1 min]

- Transcribing long audios [more than 1 min]

- Transcribing audio from streaming input

We recommend that you call this service using Vernacular provided client libraries. If your application needs to call this service using your own libraries, you should use the HTTP Endpoints.

**Supported SDKs**: [Python](https://github.com/Vernacular-ai/speech-recognition/tree/master/python)

## REST Reference

**ServiceHost:** https://asr.vernacular.ai

### Speech Recognition

| Name | Description |

|--|--|

| [recognize](docs/api_reference/Recognize.md) | Performs synchronous speech recognition: receive results after all audio has been sent and processed. |

| [longrunningrecognize](docs/api_reference/LongRunningRecognize.md) | Performs asynchronous speech recognition. Generally used for long audios |

## RPC Reference

### Speech Recognition

| Methods | Description |

|--|--|

|[Recognize](docs/rpc_reference/Recognize.md) | Performs synchronous speech recognition: receive results after all audio has been sent and processed.|

|[LongRunningRecognize](docs/rpc_reference/LongRunningRecognize.md) | Performs asynchronous speech recognition: receive results via the longrunning.Operations interface.|

|[StreamingRecognize](docs/rpc_reference/StreamingRecognize.md)	|Performs streaming speech recognition: receive results while sending audio. Supports both unidirectional and bidirectional streaming.|

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/skit-ai/speech-recognition

Awesome Lists containing this project

README