https://github.com/Sovichea/khmer_segmenter
A zero-dependency, high-performance Khmer word segmenter using the Viterbi algorithm. Optimized for dictionary accuracy, ultra-low memory footprint, and edge deployment.
https://github.com/Sovichea/khmer_segmenter
c-language dictionary-based khmer khmer-language khmer-nlp lightweight nlp portable python tokenization viterbi-algorithm word-segmentation zero-dependency zig-build-system
Last synced: 28 days ago
JSON representation
A zero-dependency, high-performance Khmer word segmenter using the Viterbi algorithm. Optimized for dictionary accuracy, ultra-low memory footprint, and edge deployment.
- Host: GitHub
- URL: https://github.com/Sovichea/khmer_segmenter
- Owner: Sovichea
- License: mit
- Created: 2026-01-02T06:59:43.000Z (about 1 month ago)
- Default Branch: main
- Last Pushed: 2026-01-07T17:17:44.000Z (about 1 month ago)
- Last Synced: 2026-01-08T03:28:33.049Z (about 1 month ago)
- Topics: c-language, dictionary-based, khmer, khmer-language, khmer-nlp, lightweight, nlp, portable, python, tokenization, viterbi-algorithm, word-segmentation, zero-dependency, zig-build-system
- Language: Python
- Homepage:
- Size: 45.6 MB
- Stars: 23
- Watchers: 0
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-khmer-language - Sovichea/khmer_segmenter - dependency, high-performance Khmer word segmenter using the Viterbi algorithm. Optimized for dictionary accuracy, ultra-low memory footprint, and edge deployment. (Awesome Khmer Language / 2. Toolkit)