https://github.com/behdad/use-syllables
Use Universal Shaping Engine code from HarfBuzz to segment text into syllables
https://github.com/behdad/use-syllables
Last synced: 3 months ago
JSON representation
Use Universal Shaping Engine code from HarfBuzz to segment text into syllables
- Host: GitHub
- URL: https://github.com/behdad/use-syllables
- Owner: behdad
- Archived: true
- Created: 2018-06-26T22:24:42.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2024-03-29T16:51:20.000Z (about 1 year ago)
- Last Synced: 2025-01-24T18:12:24.800Z (4 months ago)
- Language: C++
- Size: 30.3 KB
- Stars: 2
- Watchers: 4
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# USE Syllables
This piece of code extracted from HarfBuzz, uses the Universal Shaping
Engine implementation there to segment a list of Unicode codepoints into
syllables.## Build
Build it by just running:
$ make
Run it like:
```
$ ./main 10a00 10a10 10a01 10a10 10a01 10a01
syllable 0..1 standard_cluster
syllable 1..3 standard_cluster
syllable 3..6 standard_cluster
```## Caveats
* HarfBuzz's USE implementation might have more relaxed grammar than the
spec.* HarfBuzz's USE implementation currently does not support main Indic
scripts. While that's consistent with the spec, it might come as a
surprise. Apple already supports Indic scripts in USE. We want to
do as well. There's an issue in HarfBuzz github for that.