Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/t0mer/tts-stt

Small pyhon flask container allowing us to convert Text to Speech and Speech to Text
https://github.com/t0mer/tts-stt

docker flask python speech speech-recognition speech-to-text tts

Last synced: 3 months ago
JSON representation

Small pyhon flask container allowing us to convert Text to Speech and Speech to Text

Host: GitHub
URL: https://github.com/t0mer/tts-stt
Owner: t0mer
License: apache-2.0
Created: 2021-03-26T15:23:10.000Z (almost 4 years ago)
Default Branch: main
Last Pushed: 2022-10-19T06:11:43.000Z (over 2 years ago)
Last Synced: 2024-10-02T08:06:59.658Z (4 months ago)
Topics: docker, flask, python, speech, speech-recognition, speech-to-text, tts
Language: Python
Homepage:
Size: 1.07 MB
Stars: 9
Watchers: 1
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # TTS-STT

## Text To Speech & Speech To Text

TTS-STT is a Python & [Flask](https://flask.palletsprojects.com/en/1.1.x/) powerd, easy to use system that hepls you to convert Text to Speech or Speech to Text using small web app.

## Features

- Detecting the language of the text.

- Selecting the TTS Voice.

- Converting Speech to Text (Up to 1 minute long)

## Components and Frameworks used in TTS-STT

* [Flask](https://flask.palletsprojects.com/en/1.1.x/)

* [Loguru](https://pypi.org/project/loguru/)

* [Google cloud speech](https://pypi.org/project/google-cloud-speech/)

* [Wavinfo ](https://pypi.org/project/wavinfo/)

* [Soundfile](https://pypi.org/project/SoundFile/)

* [Wave](https://pypi.org/project/Wave/)

* [Pydub](https://pypi.org/project/pydub/)

* [PyYAML](https://pypi.org/project/PyYAML/)

* [Google trans new](https://pypi.org/project/google-trans-new/)

* [Numpy](https://pypi.org/project/numpy/)

* [Yuval Mejahez](https://github.com/rt400) for creating [pyttsreverso](https://github.com/rt400/pyttsreverso). Be sure you are running version 0.3

 

The TTS (Text to Speech) feature is free thanks to [Reverso Translations](https://www.reverso.net),

But the Speech To Text feature requires active google api cloud account with enabled billing account (pricing table can be found [here](https://cloud.google.com/speech-to-text/pricing)).

## Installation

As i mentioned, in order to use Google Speech Recognition, we need to create Google Application and enable the API. Here are the steps you need to follow to integrate your program with the Google Speech-To-Text API.

### Step 1) Create a Google Application

The first thing you need to access Google APIs is a Google account and create a Google application. You can create a google application using the google console: [Go to google console](https://console.cloud.google.com/).

Once you open the google console, click on the dropdown at the top. This dropdown is displaying your existing google application. After clicking, a pop up will appear, then click on “New Project.”

[![Google Application](https://github.com/t0mer/tts-stt/blob/main/screenshots/google%20applications%20dashboard.png?raw=true "Google Application")](https://github.com/t0mer/tts-stt/blob/main/screenshots/google%20applications%20dashboard.png?raw=true "Google Application")

[![New Application](https://github.com/t0mer/tts-stt/blob/main/screenshots/new%20project.png?raw=true "New Application")](https://github.com/t0mer/tts-stt/blob/main/screenshots/new%20project.png?raw=true "New Application")

Then enter your application name and click on Create.

### Step 2) Enable Cloud Speech-To-Text API

Once you have created your google application, you need to grant your application access to the “Google Cloud Speech-To-Text” API. To do so, go to the application dashboard and from there, go to the APIs overview. See below how to access:

[![APIs overview](https://github.com/t0mer/tts-stt/blob/main/screenshots/apis%20overview.png?raw=true "APIs overview")](https://github.com/t0mer/tts-stt/blob/main/screenshots/apis%20overview.png?raw=true "APIs overview")

Click on “Enable Apis and Service,” and then search by “speech,” then all Google APIs to do with text will be listed.

[![Enable Apis and Service](https://github.com/t0mer/tts-stt/blob/main/screenshots/enable%20api%20and%20services.png?raw=true "Enable Apis and Service")](https://github.com/t0mer/tts-stt/blob/main/screenshots/enable%20api%20and%20services.png?raw=true "Enable Apis and Service")

[![Enable STT](https://github.com/t0mer/tts-stt/blob/main/screenshots/enable%20stt%20service.png?raw=true "Enable STT")](https://github.com/t0mer/tts-stt/blob/main/screenshots/enable%20stt%20service.png?raw=true "Enable STT")

And then click “Enable.” Once enabled, you will grant permissions to your application to access the “Google Cloud Speech to Text API.”

### Step 3) Download Google Credentials

The next step is Downloading your Google credentials. The credentials are necessary so Google can authenticate your application, and therefore Google knows that their API is being accessed by you. This way, they can measure how much you are using their APIs and charge you if the consumption passes the free threshold.

Here are the steps to download the google credentials. First, from the home dashboard, got to “Go to APIs overview,” just like before, and on the left-hand side menu, click on credentials.

[![Credentials](https://github.com/t0mer/tts-stt/blob/main/screenshots/credentials.png?raw=true "Credentials")](https://github.com/t0mer/tts-stt/blob/main/screenshots/credentials.png?raw=true "Credentials")

Then click on “Create Credentials” and create a “Service Account.”

[![Service Account](https://github.com/t0mer/tts-stt/blob/main/screenshots/Service%20Account.png?raw=true "Service Account")](https://github.com/t0mer/tts-stt/blob/main/screenshots/Service%20Account.png?raw=true "Service Account")

Enter any service account name you like, and click Create.

Optional, you can grant service account access to the project, and click Done.

[![Grant Access](https://github.com/t0mer/tts-stt/blob/main/screenshots/Grant%20Access.png?raw=true "Grant Access")](https://github.com/t0mer/tts-stt/blob/main/screenshots/Grant%20Access.png?raw=true "Grant Access")

Now click on the service account you just created. The last click will take you to the service account details.

[![service account details](https://github.com/t0mer/tts-stt/blob/main/screenshots/Service%20Accounts.png?raw=true "service account details")](https://github.com/t0mer/tts-stt/blob/main/screenshots/Service%20Accounts.png?raw=true "service account details")

Go to the “Keys” section and click on “Add Key” and “Create New Key,” which will create a new key. This key is associated with your application through the service account.

[![Add Key](https://github.com/t0mer/tts-stt/blob/main/screenshots/add%20key.png?raw=true "Add Key")](https://github.com/t0mer/tts-stt/blob/main/screenshots/add%20key.png?raw=true "Add Key")

In the pop-up, select JSON and click on Create, which will download a JSON file containing the key to your machine. Please make a note of where you save this file since you will need it next.

[![Json File](https://github.com/t0mer/tts-stt/blob/main/screenshots/Key%20type.png?raw=true "Json File")](https://github.com/t0mer/tts-stt/blob/main/screenshots/Key%20type.png?raw=true "Json File")

### Step 4) Installing the container0

#### docker-compose from hub

```yaml

version: "3.7"

services:

  tts-stt:

    image: techblog/tts-stt:latest

    ports:

      - "8080:8080"

    container_name: tts-stt

    labels:

      - "com.ouroboros.enable=true"

    networks:

      - default

    volumes:

      - ./ttstt/keys/key-file.json:/opt/ttstt/keys/key-file.json

      - /etc/localtime:/etc/localtime:ro

    restart: unless-stopped

```

"key-file.json" name is mandatory (you can't change it), this is the key file you have created and downloaded in step 3.

Now, run ```docker-copmose up -d``` to pull and run your container.

Open your browser and nevigate to your container ip address wieh port 8080, you should see the following screen.

[![TTS](https://github.com/t0mer/tts-stt/blob/main/screenshots/tts-stt.PNG?raw=true "TTS")](https://github.com/t0mer/tts-stt/blob/main/screenshots/tts-stt.PNG?raw=true "TTS")