An open API service indexing awesome lists of open source software.

https://github.com/behdad/use-syllables

Use Universal Shaping Engine code from HarfBuzz to segment text into syllables
https://github.com/behdad/use-syllables

Last synced: 3 months ago
JSON representation

Use Universal Shaping Engine code from HarfBuzz to segment text into syllables

Awesome Lists containing this project

README

        

# USE Syllables

This piece of code extracted from HarfBuzz, uses the Universal Shaping
Engine implementation there to segment a list of Unicode codepoints into
syllables.

## Build

Build it by just running:

$ make

Run it like:
```
$ ./main 10a00 10a10 10a01 10a10 10a01 10a01
syllable 0..1 standard_cluster
syllable 1..3 standard_cluster
syllable 3..6 standard_cluster
```

## Caveats

* HarfBuzz's USE implementation might have more relaxed grammar than the
spec.

* HarfBuzz's USE implementation currently does not support main Indic
scripts. While that's consistent with the spec, it might come as a
surprise. Apple already supports Indic scripts in USE. We want to
do as well. There's an issue in HarfBuzz github for that.