https://github.com/n0th1ng-else/voice-to-text-bot

Telegram bot that converts Voice messages into text
https://github.com/n0th1ng-else/voice-to-text-bot

bot nodejs speech-recognition speech-to-text telegram telegram-api telegram-bot typescript

Last synced: 3 months ago
JSON representation

Telegram bot that converts Voice messages into text

Host: GitHub
URL: https://github.com/n0th1ng-else/voice-to-text-bot
Owner: n0th1ng-else
License: mit
Created: 2020-05-19T14:07:29.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2025-04-06T17:36:08.000Z (3 months ago)
Last Synced: 2025-04-06T18:31:23.132Z (3 months ago)
Topics: bot, nodejs, speech-recognition, speech-to-text, telegram, telegram-api, telegram-bot, typescript
Language: TypeScript
Homepage: https://t.me/AudioMessBot
Size: 7.79 MB
Stars: 7
Watchers: 2
Forks: 3
Open Issues: 11
Metadata Files:
- Readme: README.md
- Changelog: HISTORY.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

        


    

        

    





    

        

             

        

    

   

        

             

        

    

    

        

             

        

    

    

        

             

        

    

    

        

             

        

    

    

        

            

        

    



# Telegram Bot Converts Voice Messages Into Text



  



This is the simple bot that converts voice into text.

I tried to review available public speech recognition services

and the results you can see below in the table.

# Tech Stack

- [Typescript](https://www.typescriptlang.org/)

- [Fastify](https://fastify.dev/)

- [Axios](https://axios-http.com/)

- [PostgreSQL](https://node-postgres.com/)

- Google Analytics

- [Amplitude](https://amplitude.com/)

# Service model

```mermaid

flowchart BT

  subgraph tg[Telegram]

    voice[Voice message]

    audio[Audio]

    video[Video note]

    text[Text message]

    bot[AudioMessBot API]

  end

  subgraph cluster[Replicas]

    r1{{Replica 1}}

    ar{{Active replica}}

    r2{{Replica N}}

  end

  voice-->bot

  audio-->bot

  video-->bot

  bot-->text

  bot---ar

  ar---db[(PSQL\nDatabase)]

  ar---cloud((Cloud API provider))

```

# Services overview

| Service provider                                                                                | Russian lang | Synchronous API | Duration limitation      | File upload     | Speed                   |

| ----------------------------------------------------------------------------------------------- | ------------ | --------------- | ------------------------ | --------------- | ----------------------- |

| [IBM Watson](https://www.ibm.com/cloud/watson-speech-to-text)                                   | no           | no              | N/A                      | Unknown         | Unknown                 |

| [Microsoft Azure](https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text) | no           | no              | N/A                      | Unknown         | Unknown                 |

| [Amazon AWS](https://aws.amazon.com/transcribe)                                                 | **yes**      | no              | Unlimited                | S3              | Minutes                 |

| [Google Cloud](https://cloud.google.com/speech-to-text)                                         | **yes**      | **yes**         | 1 minute^\*[1] | Direct / GDrive | Instant^\*[2] |

| [Wit.ai](https://wit.ai)                                                                        | **yes**      | **yes**         | 5 minutes                | Direct          | Instant                 |

- For direct upload

  


  ¹ Unlimited for asynchronous upload via Google Drive

  


  ² Takes a while for asynchronous upload via Google Drive

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/n0th1ng-else/voice-to-text-bot

Awesome Lists containing this project

README