Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/JackHCC/Chinese-Tokenization
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
https://github.com/JackHCC/Chinese-Tokenization
bert-crf bilstm-crf hmm-viterbi-algorithm ngram nlp tokenization
Last synced: 3 months ago
JSON representation
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
- Host: GitHub
- URL: https://github.com/JackHCC/Chinese-Tokenization
- Owner: JackHCC
- Created: 2022-04-05T13:29:47.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2022-06-15T15:13:11.000Z (over 2 years ago)
- Last Synced: 2024-07-30T22:12:34.701Z (3 months ago)
- Topics: bert-crf, bilstm-crf, hmm-viterbi-algorithm, ngram, nlp, tokenization
- Language: Python
- Homepage:
- Size: 45.4 MB
- Stars: 32
- Watchers: 1
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md