https://github.com/skit-ai/speech-recognition
SDKs and docs for Skit's speech to text service
https://github.com/skit-ai/speech-recognition
asr multilingual-speech-recognition speech-recognition speech-recognition-api speech-to-text
Last synced: about 1 month ago
JSON representation
SDKs and docs for Skit's speech to text service
- Host: GitHub
- URL: https://github.com/skit-ai/speech-recognition
- Owner: skit-ai
- License: apache-2.0
- Created: 2020-02-07T07:32:57.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2023-07-05T21:00:23.000Z (almost 2 years ago)
- Last Synced: 2025-03-25T09:17:57.325Z (about 2 months ago)
- Topics: asr, multilingual-speech-recognition, speech-recognition, speech-recognition-api, speech-to-text
- Language: Python
- Homepage:
- Size: 447 KB
- Stars: 21
- Watchers: 4
- Forks: 5
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Speech-to-Text API
Converts audio to textWe support these ten indian languages ([language codes](https://github.com/Vernacular-ai/speech-recognition/blob/master/docs/types/RecognitionConfig.md#languagesupport)).
- Hindi
- English
- Marathi
- Kannada
- Malayalam
- Bengali
- Gujarati
- Punjabi
- Telugu
- Tamil## Authentication
~~To get access to our APIs reach out to us at [email protected]~~
We do not provide public access token for the APIs anymore.## Ways to use the Service
- Transcribing short audios [audios upto 1 min]
- Transcribing long audios [more than 1 min]
- Transcribing audio from streaming inputWe recommend that you call this service using Vernacular provided client libraries. If your application needs to call this service using your own libraries, you should use the HTTP Endpoints.
**Supported SDKs**: [Python](https://github.com/Vernacular-ai/speech-recognition/tree/master/python)
## REST Reference
**ServiceHost:** https://asr.vernacular.ai
### Speech Recognition
| Name | Description |
|--|--|
| [recognize](docs/api_reference/Recognize.md) | Performs synchronous speech recognition: receive results after all audio has been sent and processed. |
| [longrunningrecognize](docs/api_reference/LongRunningRecognize.md) | Performs asynchronous speech recognition. Generally used for long audios |## RPC Reference
### Speech Recognition
| Methods | Description |
|--|--|
|[Recognize](docs/rpc_reference/Recognize.md) | Performs synchronous speech recognition: receive results after all audio has been sent and processed.|
|[LongRunningRecognize](docs/rpc_reference/LongRunningRecognize.md) | Performs asynchronous speech recognition: receive results via the longrunning.Operations interface.|
|[StreamingRecognize](docs/rpc_reference/StreamingRecognize.md) |Performs streaming speech recognition: receive results while sending audio. Supports both unidirectional and bidirectional streaming.|