https://github.com/rhasspy/phonetisaurus-pypi

Python wrapper for phonetisaurus grapheme to phoneme tool
https://github.com/rhasspy/phonetisaurus-pypi

Last synced: 3 months ago
JSON representation

Python wrapper for phonetisaurus grapheme to phoneme tool

Host: GitHub
URL: https://github.com/rhasspy/phonetisaurus-pypi
Owner: rhasspy
License: mit
Created: 2020-10-06T18:19:39.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2021-03-11T15:40:41.000Z (over 4 years ago)
Last Synced: 2025-03-28T08:11:55.923Z (3 months ago)
Language: Python
Size: 37.1 MB
Stars: 12
Watchers: 2
Forks: 2
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Phonetisaurus for Python

Python wrapper for the excellent [phonetisaurus](https://github.com/AdolfVonKleist/Phonetisaurus) grapheme to phoneme tool ([license](https://github.com/AdolfVonKleist/Phonetisaurus/blob/master/LICENSE)).

Includes pre-built binaries for:

* `x86_64` - desktop/laptop/server (64-bit)
* `armv6l` - Raspberry Pi 0/1
* `armv7l` - Raspberry Pi 2/3/4 (32-bit)
* `aarch64` - Raspberry Pi 3/4 (64-bit)

## Requirements

* Python 3.7+
* Linux
* Tested with Debian Buster

## Installing

For `x86_64` systems:

```sh
$ pip install phonetisaurus
```

For Raspberry Pi, see [Releases](https://github.com/rhasspy/phonetisaurus-pypi/releases) for compatible wheels:

* Raspberry Pi 0/1
* `phonetisaurus--py3-none-linux_armv6l.whl`
* Raspberry Pi 2/3/4 (32-bit)
* `phonetisaurus--py3-none-linux_armv7l.whl`
* Raspberry Pi 3/4 (64-bit)
* `phonetisaurus--py3-none-linux_aarch64.whl`

## Training

Assuming you have a lexicon formatted like [the CMU pronouncing dictionary](https://github.com/cmusphinx/cmudict):

```
word1 phoneme1 phoneme2 ...
word2 phoneme1 phoneme2 phoneme3 ...
```

saved to `lexicon.dict` run:

```sh
$ phonetisaurus train --model /path/to/write/g2p.fst /path/to/lexicon.dict
```

You may supply more than one lexicon.

See `phonetisaurus train --help` for more options.

## Predicting

```sh
$ phonetisaurus predict --model /path/to/g2p.fst word1 word2 ...
```

If no words are provided on the command line, they will be read line-by-line from standard in.

You may optionally supply one or more `--lexicon /path/to/lexicon.dict` arguments to avoid guessing pronunciations for known words.

See `phonetisaurus predict --help` for more options.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/rhasspy/phonetisaurus-pypi

Awesome Lists containing this project

README