Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/changil/avspeech-downloader

AVSpeech downloader
https://github.com/changil/avspeech-downloader

Last synced: about 2 months ago
JSON representation

AVSpeech downloader

Host: GitHub
URL: https://github.com/changil/avspeech-downloader
Owner: changil
License: mit
Created: 2018-08-31T20:52:55.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2019-01-30T05:10:16.000Z (over 5 years ago)
Last Synced: 2024-05-22T05:34:35.361Z (4 months ago)
Language: Shell
Homepage:
Size: 6.84 KB
Stars: 65
Watchers: 1
Forks: 16
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# AVSpeech Downloader
A bash script to download [AVSpeech](https://looking-to-listen.github.io/avspeech/) dataset.

## Usage
```bash
$ download.sh path-to-csv-file output-directory
```
You may stop the downloader by pressing `Ctrl-C`. Next runs recognize already downloaded files and will not try to re-download them.

## Options
`download.sh` acknowledges the following environment variables.
* `dryrun` set to any nonzero value makes the downloader print commands to the console without downloading any videos.
* `njobs` sets the number of concurrent downloader processes to run and defaults to the number of CPU cores.
* With `faster` set to `1`, the downloader tries to download 720p or 360p videos instead of the highest quality videos. Additionally, when `faster` is set to `2`, the downloader forces _FFmpeg_ to copy streams, running much faster with no transcoding carried out. See the known issue below.

## Required software packages
The following packages are required to run the downloader. It is important to have the latest versions of both _youtube-dl_ and _FFmpeg_.
* [youtube-dl](https://rg3.github.io/youtube-dl/)
* [FFmpeg](https://www.ffmpeg.org/)
* [GNU Parallel](https://www.gnu.org/software/parallel/)

## Examples
```bash
$ download.sh avspeech_train.csv data/train
$ njobs=12 faster=1 download.sh avspeech_train.csv data/train
```

## Known issue
_FFmpeg_ uses keyframe seeking when stream copying, which happens with `faster=2`. When a cut does not start from a keyframe, which happens most of the time, it cuts the video at the closest preceding keyframe and sets a negative start time to compensate for it. Thus, any subsequent tools that take the cut video clips as input should take the start time into account. Most video players do, but if you programatically process video clips, chances are you need to do it yourself and discard the first part of both audio and video streams accordingly.

## Contact
Please contact [Changil Kim](mailto:[email protected]) if you have questions of comments.