https://github.com/adasek/czech-syllable-splitter
Splitting Czech words into syllables.
https://github.com/adasek/czech-syllable-splitter
count counter czech czech-language language natural-language-processing nlp python syllable syllables
Last synced: about 2 months ago
JSON representation
Splitting Czech words into syllables.
- Host: GitHub
- URL: https://github.com/adasek/czech-syllable-splitter
- Owner: adasek
- License: mit
- Created: 2024-06-14T21:47:31.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-06-14T22:03:25.000Z (11 months ago)
- Last Synced: 2025-02-13T03:03:12.001Z (2 months ago)
- Topics: count, counter, czech, czech-language, language, natural-language-processing, nlp, python, syllable, syllables
- Language: Python
- Homepage:
- Size: 8.79 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## Czech Syllable Splitter
Alogirthm for splitting Czech words into syllables.
Inspired by a syllable counting algorithm from David Lukeš counting the vowels.With Klára Bendová we put together rules to expand the vowels into syllables,
empirically finding some common letter groups to stay intact.This is not a perfect solution, but it is a good start for Czech language processing.
Measuring the accuracy of this algorithm is a to-do, as well as adding more rules if needed.### Installation
```bash
pip install czech-syllable-splitter
```
or using Poetry package manager
```bash
poetry add czech-syllable-splitter
```### Usage
```python
from czech_syllable_splitter import count_syllables, split_to_syllables, split_to_charactersprint(split_to_syllables("příliš"))
print(split_to_characters("přesný"))
print(count_syllables("přísný"))
```### Lint & Test
```bash
poetry run python3 -m pytest
poetry run mypy czech_syllable_splitter
poetry run pylint czech_syllable_splitter```