Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/waikato-ufdl/wai-annotations-commonvoice
wai.annotations module for managing common voice audio annotations.
https://github.com/waikato-ufdl/wai-annotations-commonvoice
Last synced: about 2 months ago
JSON representation
wai.annotations module for managing common voice audio annotations.
- Host: GitHub
- URL: https://github.com/waikato-ufdl/wai-annotations-commonvoice
- Owner: waikato-ufdl
- License: apache-2.0
- Created: 2021-05-12T21:32:15.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-09-05T03:59:08.000Z (over 2 years ago)
- Last Synced: 2023-03-04T15:28:02.415Z (almost 2 years ago)
- Language: Python
- Homepage: https://ufdl.cms.waikato.ac.nz/wai-annotations-manual/
- Size: 15.6 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGES.rst
- License: LICENSE
Awesome Lists containing this project
README
# wai-annotations-commonvoice
wai.annotations module for managing common voice audio annotations.The manual is available here:
https://ufdl.cms.waikato.ac.nz/wai-annotations-manual/
## Plugins
### FROM-COMMON-VOICE-SP
Reads speech transcriptions in the Mozilla Common-Voice TSV-format#### Domain(s):
- **Speech Domain**#### Options:
```
usage: from-common-voice-sp [-I FILENAME] [-i FILENAME] [-N FILENAME] [-n FILENAME] [--seed SEED] [--rel-path REL_PATH]optional arguments:
-I FILENAME, --inputs-file FILENAME
Files containing lists of input files (can use glob syntax)
-i FILENAME, --input FILENAME
Input files (can use glob syntax)
-N FILENAME, --negatives-file FILENAME
Files containing lists of negative files (can use glob syntax)
-n FILENAME, --negative FILENAME
Files that have no annotations (can use glob syntax)
--seed SEED the seed to use for randomisation
--rel-path REL_PATH the relative path from the annotations file to the audio files
```### TO-COMMON-VOICE-SP
Writes speech transcriptions in the Mozilla Common-Voice TSV-format#### Domain(s):
- **Speech Domain**#### Options:
```
usage: to-common-voice-sp [--annotations-only] -o PATH [--split-names SPLIT NAME [SPLIT NAME ...]] [--split-ratios RATIO [RATIO ...]]optional arguments:
--annotations-only skip the writing of data files, outputting only the annotation files
-o PATH, --output PATH
the filename of the TSV file to write the annotations into
--split-names SPLIT NAME [SPLIT NAME ...]
the names to use for the splits
--split-ratios RATIO [RATIO ...]
the ratios to use for the splits
```