An open API service indexing awesome lists of open source software.

https://github.com/canclid/rime-cantonese-upstream

rime-cantonese 上游詞表倉庫
https://github.com/canclid/rime-cantonese-upstream

Last synced: 7 months ago
JSON representation

rime-cantonese 上游詞表倉庫

Awesome Lists containing this project

README

          

[粵語](README.md)

# rime-cantonese Upstream Word List

This repo serves as the upstream data storage for [rime-cantonese](https://github.com/rime/rime-cantonese). The rime-cantonese repo regularly pulls data from this repo and compile the lexicon.

## Structure

This repo contains the following files:

1. `char.csv`: Characters
1. `word.csv`: Common words
1. `phrase_fragment.csv`: Short phrases, input fragments and combos, ngrams
1. `trending.csv`: Uncategorized newly added words.

## Data sources

Source of single character entries

- LSHK 電腦用漢字粵語拼音表 https://github.com/lshk-org/jyutping-table

Consultant resources for single character entries

- [Unihan 12.0 kCantonese](https://www.unicode.org/charts/unihan.html)
- [粵語審音配詞字庫](https://humanum.arts.cuhk.edu.hk/Lexis/lexi-can/)
- [《廣州話正音字典》](https://github.com/jyutnet/cantonese-books-data/tree/master/2004_%E5%BB%A3%E5%B7%9E%E8%A9%B1%E6%AD%A3%E9%9F%B3%E5%AD%97%E5%85%B8)

Source of word entries

- [粵典](https://words.hk/faiman/analysis/wordslist/)
- [冚唪唥粵文](https://hambaanglaang.hk/)
- [《實用廣州話分類詞典》](https://github.com/rime/rime-cantonese/blob/build/lexicons/%E3%80%8A%E5%AF%A6%E7%94%A8%E5%BB%A3%E5%B7%9E%E8%A9%B1%E5%88%86%E9%A1%9E%E8%A9%9E%E5%85%B8%E3%80%8B.tsv)
- A Dictionary of Cantonese Slang
- 《廣州話詞典》
- 《地道廣州話用語》

## Credits

- laubonghaudoi
- Ayaka
- Leimaau
- Chaak
- Bing Cheung
- Cherry
- Lili Ou
- Philip Wong
- Henry Chan
- Alex Man