https://github.com/idiap/iss-dicts
ISS scripts for handling pronunciation dictionaries
https://github.com/idiap/iss-dicts
Last synced: about 1 year ago
JSON representation
ISS scripts for handling pronunciation dictionaries
- Host: GitHub
- URL: https://github.com/idiap/iss-dicts
- Owner: idiap
- License: bsd-3-clause
- Created: 2013-07-11T14:04:00.000Z (almost 13 years ago)
- Default Branch: master
- Last Pushed: 2017-07-04T12:40:51.000Z (almost 9 years ago)
- Last Synced: 2025-03-23T01:03:24.985Z (about 1 year ago)
- Language: Python
- Size: 23.4 KB
- Stars: 6
- Watchers: 8
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: COPYING
Awesome Lists containing this project
README
ISS scripts for dictionary maintenance
======================================
These scripts are sufficient to convert the distributed forms of
dictionaries into forms useful for our tools (notably HTK and ISS).
Once a dictionary is in a standard form, the generic tools in ISS can
be used to manipulate it further.
Instructions:
1. Run `./CreateLinks.sh`
- This will link the media directories in each directory
Then for each dictionary that you need, `cd ` and:
2. Run `./CreateDicts.sh`
- Converts the native dictionary format to something more standard
3. Run `./CreatePSaurus.sh`
- This will run phonitisaurus to generate an FST for the dictionary
Note that phonetisaurus FST creation (rather, the alignment stage) can
use up a lot of memory, so it may be necessary to run it in the grid.
[Phil Garner](http://www.idiap.ch/~pgarner), March 2013