Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zackees/pydeepspeech
Easy setup for Mozillas Deepspeech transcriber
https://github.com/zackees/pydeepspeech
Last synced: about 1 month ago
JSON representation
Easy setup for Mozillas Deepspeech transcriber
- Host: GitHub
- URL: https://github.com/zackees/pydeepspeech
- Owner: zackees
- License: mit
- Created: 2021-05-08T01:44:30.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-02-06T17:09:49.000Z (11 months ago)
- Last Synced: 2024-08-09T13:13:40.458Z (5 months ago)
- Language: Python
- Size: 7.4 MB
- Stars: 6
- Watchers: 1
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[![Actions Status](https://github.com/zackees/pydeepspeech/workflows/MacOS_Tests/badge.svg)](https://github.com/zackees/pydeepspeech/actions/workflows/push_macos.yml)
[![Actions Status](https://github.com/zackees/pydeepspeech/workflows/Win_Tests/badge.svg)](https://github.com/zackees/pydeepspeech/actions/workflows/push_win.yml)
[![Actions Status](https://github.com/zackees/pydeepspeech/workflows/Ubuntu_Tests/badge.svg)](https://github.com/zackees/pydeepspeech/actions/workflows/push_ubuntu.yml)# pydeepspeech
* The simpliest way to use AI to generate transcriptions from a wav file.
* This project uses the Mozilla DeepSpeech engine built from the included demo:
* https://github.com/mozilla/DeepSpeech-examples/tree/r0.9/vad_transcriber## Why you need this
Mozilla's deep speech can't process long voice samples. `pydeepspeech` fixes this by "chunking" the input sound into seperate wav files that are then individualy processed. Wav files are cut along periods of detected silence, controlled by the `aggressive` parameter.
Besides this, `pydeepspeech` is probably better to use anyway because it's *much* simpler to install than Mozilla's Deepspeech because the required data models needed for `pydeepspeech` are automatically downloaded and installed on first use.
# Quick start
Console api:
```
$ pip install pydeepspeech
$ pydeepspeech --wav_file --aggressive 1 --out_file
```-or-
```
$ pip install pydeepspeech
$ pydeepspeech --wav_file --out_file --model_dir
```-or-
```
$ pip install pydeepspeech
$ pydeepspeech_installmodels --pbmm --scorer
$ pydeepspeech --wav_file --out_file
```Or in python
```
from pydeepspeech.transcribe import transcribe
transcribe(...)
```## Optional: Create a virtual python package
Download and install virtual env:
```
# Download
curl -X GET https://raw.githubusercontent.com/zackees/make_venv/main/make_venv.py -o make_env.py
python make_env.py # Make the environment
source activate.sh # Enter environment
$ pip install pydeepspeech
```
To get back into the environment execute `source activate.sh` (if windows, you must be using git-bash)# Testing
Testing and linting is very simple. Just run `tox` ([link](https://tox.wiki/en/latest/install.html)).
```
$ pip install tox
$ tox
```