https://github.com/paralogical/glish
map all words to single-syllable version
https://github.com/paralogical/glish
Last synced: 20 days ago
JSON representation
map all words to single-syllable version
- Host: GitHub
- URL: https://github.com/paralogical/glish
- Owner: paralogical
- License: mit
- Created: 2023-12-14T21:01:39.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-03T03:22:42.000Z (6 months ago)
- Last Synced: 2024-11-03T04:19:17.544Z (6 months ago)
- Language: TypeScript
- Size: 35.2 MB
- Stars: 263
- Watchers: 11
- Forks: 18
- Open Issues: 20
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- starred - paralogical/glish - map all words to single-syllable version (TypeScript)
README
# Glish
Watch the video about this project: https://youtu.be/sRbcw2sGkJw
Interactive Demo Translator tool here: https://paralogical.dev/glish/
**Goal: Make a version of English where every word is only one syllable**
Inputs:
- words by frequency (optimize monosyllabification for more common words)
inputs/word_frequency.txt
- words with pronunciations and split by syllables (CMU Dict syllablized)
Note: multiple valid pronunciations for any given word,
but all American englishStages:
- `syllablize.ts` → convert CMU dict to JSON mapping of word → IPA split by syllables
- `main.ts` → load IPA syllables and generate new monosyllabic version of all words
- `sonorityGraph.ts` → data structure that helps generate new syllables following sonority sequencing.
- `respellIPA.ts` → convert IPA back into "readable" latin alphabet.To run code to generate Glish language mapping,
- `ts-node syllablize.ts` to generate outputs/syllablizedIPA.json + syllableGraph + big list of randomly generated syllables
- `ts-node main.ts` to generate outputs/monosyllabic.json & other monosyllabic resultsTo run UI,
- `cd ui`
- `npm run dev`