Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/n0th1ng-else/voice-to-text-bot
Telegram bot that converts Voice messages into text
https://github.com/n0th1ng-else/voice-to-text-bot
bot nodejs speech-recognition speech-to-text telegram telegram-api telegram-bot typescript
Last synced: 23 days ago
JSON representation
Telegram bot that converts Voice messages into text
- Host: GitHub
- URL: https://github.com/n0th1ng-else/voice-to-text-bot
- Owner: n0th1ng-else
- License: mit
- Created: 2020-05-19T14:07:29.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2024-11-21T22:16:08.000Z (about 2 months ago)
- Last Synced: 2024-11-21T23:20:40.477Z (about 2 months ago)
- Topics: bot, nodejs, speech-recognition, speech-to-text, telegram, telegram-api, telegram-bot, typescript
- Language: TypeScript
- Homepage: https://t.me/AudioMessBot
- Size: 7.66 MB
- Stars: 7
- Watchers: 3
- Forks: 3
- Open Issues: 15
-
Metadata Files:
- Readme: README.md
- Changelog: HISTORY.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# Telegram Bot Converts Voice Messages Into Text
This is the simple bot that converts voice into text.
I tried to review available public speech recognition services
and the results you can see below in the table.# Tech Stack
- [Typescript](https://www.typescriptlang.org/)
- [Fastify](https://fastify.dev/)
- [Axios](https://axios-http.com/)
- [PostgreSQL](https://node-postgres.com/)
- Google Analytics
- [Amplitude](https://amplitude.com/)# Service model
```mermaid
flowchart BT
subgraph tg[Telegram]
voice[Voice message]
audio[Audio]
video[Video note]
text[Text message]
bot[AudioMessBot API]
end
subgraph cluster[Replicas]
r1{{Replica 1}}
ar{{Active replica}}
r2{{Replica N}}
end
voice-->bot
audio-->bot
video-->bot
bot-->text
bot---ar
ar---db[(PSQL\nDatabase)]
ar---cloud((Cloud API provider))
```# Services overview
| Service provider | Russian lang | Synchronous API | Duration limitation | File upload | Speed |
| ----------------------------------------------------------------------------------------------- | ------------ | --------------- | ------------------------ | --------------- | ----------------------- |
| [IBM Watson](https://www.ibm.com/cloud/watson-speech-to-text) | no | no | N/A | Unknown | Unknown |
| [Microsoft Azure](https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text) | no | no | N/A | Unknown | Unknown |
| [Amazon AWS](https://aws.amazon.com/transcribe) | **yes** | no | Unlimited | S3 | Minutes |
| [Google Cloud](https://cloud.google.com/speech-to-text) | **yes** | **yes** | 1 minute\*[1] | Direct / GDrive | Instant\*[2] |
| [Wit.ai](https://wit.ai) | **yes** | **yes** | 5 minutes | Direct | Instant |- For direct upload
1 Unlimited for asynchronous upload via Google Drive
2 Takes a while for asynchronous upload via Google Drive