An open API service indexing awesome lists of open source software.

https://github.com/luser/auto-caption

Produce captions for videos using PocketSphinx speech recognition
https://github.com/luser/auto-caption

Last synced: about 1 year ago
JSON representation

Produce captions for videos using PocketSphinx speech recognition

Awesome Lists containing this project

README

          

This project contains two scripts that use [PocketSphinx](https://cmusphinx.sourceforge.net/) to produce captions
from a video file using speech recognition. The dependencies are a bit tricky,
a Dockerfile is provided to produce a working environment. Specifically,
the script currently relies on [an unlanded patch](https://github.com/luser/pocketsphinx/commit/18f6755caafd04726e76569aa7daa7c6211ea05e) to the PocketSphinx
Gstreamer plugin.

`caption.py` takes a media file and generates a caption file. You can test
this script with the pre-built docker image `luser/auto-caption:0.2`, for example:
```
docker run -t luser/auto-caption:0.1 ./run.sh https://people.mozilla.org/~tmielczarek/test-long.wav
```

Will produce captions on stdout.

`adapt-from-captions.py` takes a media file, a manually corrected captions
file, and a PocketSphinx acoustic model, and adapts the model by feeding it
the matched input audio and corrected text. It will output
`updated-model.tar.gz` in the working directory if it succeeds.

Any copyright is dedicated to the Public Domain.
http://creativecommons.org/publicdomain/zero/1.0/