Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/westonruter/esv-text-audio-aligner
ESV Text/Audio Aligner to programmatically obtain the timings for each word in the corresponding audio
https://github.com/westonruter/esv-text-audio-aligner
Last synced: 29 days ago
JSON representation
ESV Text/Audio Aligner to programmatically obtain the timings for each word in the corresponding audio
- Host: GitHub
- URL: https://github.com/westonruter/esv-text-audio-aligner
- Owner: westonruter
- Archived: true
- Created: 2011-08-31T21:51:38.000Z (over 13 years ago)
- Default Branch: master
- Last Pushed: 2013-03-29T19:16:53.000Z (over 11 years ago)
- Last Synced: 2024-08-03T04:08:03.817Z (4 months ago)
- Language: JavaScript
- Homepage:
- Size: 262 KB
- Stars: 91
- Watchers: 12
- Forks: 20
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- low-resource-languages - esv-text-audio-aligner - ESV Text/Audio Aligner to programmatically obtain the timings for each word in the corresponding audio. (Software / Utilities)
README
ESV Text/Audio Aligner
======================Pulls down the ESV text and audio from the ESV API, then aligns the text and
audio chapter-by-chapter by means of [CMU Sphinx](http://cmusphinx.sourceforge.net/)
and generates `{book}.{chapter}.timings.json` timing files in the `data/`
directory (see [output for John 3](https://github.com/westonruter/esv-text-audio-aligner/blob/master/data/John.3.timings.json)).
The audio and text for each chapter is also saved in this directory.
Previously-aligned chapters are skipped unless `--force`d. There is an
ongoing [discussion](http://sourceforge.net/projects/cmusphinx/forums/forum/382337/topic/4503550)
on the CMU Sphinx forums on the quality of the alignment data it provides.
__Author__: Weston Ruter (@westonruter)
__GitHub__: https://github.com/westonruter/esv-text-audio-aligner
__Dependencies__: Python 2.7, java, ant, [sox](http://sox.sourceforge.net/), svnThe ESV Text and MP3 data downloaded by this script is subject to [copyright](http://www.crossway.org/rights-permissions/esv/):
> The Holy Bible, English Standard Version copyright (c)2001 by Crossway Bibles, a
> publishing ministry of Good News Publishers. All rights reserved.ESV API usage terms (see [full conditions](http://www.esvapi.org/#conditions)):
> You can access the ESV text using the key "IP" (without the quotes). This
> key limits you to 5,000 queries per day from a single IP address. You are
> bound by the below conditions of use, including the non-commercial
> aspects.Background
----------This project was birthed out of a desire to realize the read-along app, where
the currently-spoken word in the audio is higlighted in the text on the screen.
I made an [HTML5 Audio Read-Along prototype](https://github.com/westonruter/html5-audio-read-along#readme)
(includes writeup) featuring a passage from Luke 2 of the ESV, but the actual audio timing
data for the Luke 2 sample I had to obtain manually (read: painstakingly). This
project aims to generate the audio word timings automatically so that the entire
Bible can be used in a read-along app.Usage
-----$ python align.py [-f|--force] [osisBook [[chapter], [chapter]...]]...
Examples
--------Align Genesis 1:
$ python align.py Gen 1
Align John 3, overridding any existing timings:
$ python align.py -f John 3
Align Genesis 1-3, Psalm 1, and all of Colossians:
$ python align.py Gen 1 2 3 Ps 1 Col
License
-------
Dual licensed under the MIT or GPL Version 2 licenses.
MIT License: http://creativecommons.org/licenses/MIT/
GPL 2.0 license: http://creativecommons.org/licenses/GPL/2.0/