Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/saharmor/realtime-transcription-playground
A real-time transcription project using React and socketio
https://github.com/saharmor/realtime-transcription-playground
google-speech-recognition microphone react speech2text transcription
Last synced: about 1 month ago
JSON representation
A real-time transcription project using React and socketio
- Host: GitHub
- URL: https://github.com/saharmor/realtime-transcription-playground
- Owner: saharmor
- License: mit
- Created: 2021-07-01T19:25:06.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-04-14T21:34:49.000Z (over 1 year ago)
- Last Synced: 2023-11-07T17:19:39.095Z (about 1 year ago)
- Topics: google-speech-recognition, microphone, react, speech2text, transcription
- Language: JavaScript
- Homepage:
- Size: 232 KB
- Stars: 136
- Watchers: 6
- Forks: 16
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Real-time Transcription Playground
A real-time transcription project using React and a socketio python server. The goal of this project is to enable developers to create web demos and speech2text prototypes with just a few lines of code. Examples can be medical dictation apps, a note-taking CRM for entrepreneurs, etc.
*Currently only supports real-time transcription using Google Cloud Speech*
# Demo
https://user-images.githubusercontent.com/6180201/124362454-370e6600-dc35-11eb-8374-77da5aec25b2.mp4# Installation
* Python 3 [instructions](https://realpython.com/installing-python/)
* `yarn` [instructions](https://classic.yarnpkg.com/en/docs/install/#mac-stable)## Google Speech API
The code assumes an environment variable `GOOGLE_SERVICE_JSON_FILE` that points to a valid GCP service account file.If you need to get a service account:
- Within your Google Cloud console, create or select a project
- Enable the Cloud Speech API for that project
- Create a service account
- Download a private key as JSONMore info in Google Cloud's docs [here](https://cloud.google.com/speech-to-text/docs/quickstart-client-libraries#before-you-begin) and [here](https://codelabs.developers.google.com/codelabs/cloud-speech-text-python3#0).
Then, set the environment variable `GOOGLE_SERVICE_JSON_FILE` to the path of the JSON file containing your service account key, e.g. `/users/sahar/documents/sample-project-3c1a5892b00e.json`. Further details can be found in this [Medium article](https://medium.com/geekculture/how-to-build-a-full-stack-transcription-app-with-google-cloud-react-and-python-2dfdcb5e556f).
# Setup
1. Clone or fork this repository
2. Create a virtual environment in the root directory: `python -m venv $ENV_NAME`
3. Activate the virtual environment: ` source $ENV_NAME/bin/activate` (for MacOS, Unix, or Linux users) or ` .\ENV_NAME\Scripts\activate` (for Windows users)
4. Install requirements: `pip install -r backend/requirements.txt`
5. Set your environment variable `GOOGLE_SERVICE_JSON_FILE` to point to your file path
6. Run `yarn install` in the root directory
7. Run `yarn start` to start the frontend and `start-backend` to run the backend