Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bluelovers/node-segment-dict
novel-segment 分詞字典
https://github.com/bluelovers/node-segment-dict
Last synced: 3 days ago
JSON representation
novel-segment 分詞字典
- Host: GitHub
- URL: https://github.com/bluelovers/node-segment-dict
- Owner: bluelovers
- Created: 2018-03-15T16:49:36.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2024-11-09T08:28:31.000Z (4 days ago)
- Last Synced: 2024-11-09T09:26:34.563Z (4 days ago)
- Language: TypeScript
- Homepage: https://bluelovers.github.io/node-segment-dict/
- Size: 44.5 MB
- Stars: 3
- Watchers: 3
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Funding: .github/FUNDING.yml
Awesome Lists containing this project
README
# segment-dict
## live demo
> https://segment-api.bluelovers.now.sh/demo.html
### api
```
https://segment-api.bluelovers.now.sh/conv?input=从时间上来说是过了数秒。视线的焦点总算稳定后,那是在顶棚略低的运动跑车的右边驾驶席上。稳稳坐在车内驾驶席上的要,朝着映在车内镜的脸庞望去。柔顺的黑发加上好战的锐利目光,穿在身上的松垮衬衫被卷起了袖子,系在脖子上的红色领带显得极为松垮。把视线落到双手上,带着手表的手腕下是不怎么厚的蓝色裤子,腿上还挂着黑色的随身包。一瞬间脑子还有点没转过来,之后总算理解了情况。
``````
https://segment-api.bluelovers.now.sh/?input=从时间上来说是过了数秒。视线的焦点总算稳定后,那是在顶棚略低的运动跑车的右边驾驶席上。稳稳坐在车内驾驶席上的要,朝着映在车内镜的脸庞望去。柔顺的黑发加上好战的锐利目光,穿在身上的松垮衬衫被卷起了袖子,系在脖子上的红色领带显得极为松垮。把视线落到双手上,带着手表的手腕下是不怎么厚的蓝色裤子,腿上还挂着黑色的随身包。一瞬间脑子还有点没转过来,之后总算理解了情况。
```## dict
* [dict](https://github.com/bluelovers/ws-segment/tree/master/packages/segment-dict/dict)
### Breaking Changes
請注意 從 2.0.0 版之後 開始 更改了 synonym 字典的格式
* 原版為一對一 => 錯字,正字
* 這裡為一對多 並且順序與原版相反 => 正字,錯字,...以,分隔更多字## demo
```ts
import { requireLoader, requireLoaderModule, getDictPath } from 'segment-dict';requireLoader('jieba')('../dict/nodejieba/user.dict.utf8')
.then(function (dict)
{
console.log(dict);
})
;requireLoaderModule('segment').load(getDictPath('segment', 'dict.txt'))
.then(function (dict)
{
console.log(dict);
})
;
```## links
* [novel-segment](https://github.com/bluelovers/ws-segment/tree/master/packages/novel-segment)
* [線上測試 by RunKit](https://npm.runkit.com/novel-segment)
---
* [中文詞彙的意義與詞意](http://cwn.ling.sinica.edu.tw/query1.htm)
* http://cdict.info/
* http://www.kwuntung.net/synonym/
* [萌典](https://www.moedict.tw)
* [教育部重編國語辭典修訂本](http://dict.revised.moe.edu.tw/cgi-bin/cbdic/gsweb.cgi?ccd=bKKilu&o=e0&sec=sec1&index=1)---
* http://opencc.byvoid.com/
---
* [search:jieba](https://www.npmjs.com/search?q=jieba)
* [search:nlp chinese](https://www.npmjs.com/search?q=nlp%20chinese)
* [keywords:中文分词](https://www.npmjs.com/search?q=keywords:%E4%B8%AD%E6%96%87%E5%88%86%E8%AF%8D)
* [keywords:分词](https://www.npmjs.com/search?q=keywords:%E5%88%86%E8%AF%8D)### zh
* [segmentit](https://www.npmjs.com/package/segmentit)
* [node-opencc](https://github.com/compulim/node-opencc)
* [OpenCC](https://github.com/BYVoid/OpenCC)
* [hanzi](https://www.npmjs.com/package/hanzi)
* [jieba-js](https://github.com/bluelovers/jieba-js)
* [nodescws](https://github.com/dotSlashLu/nodescws)
* [node-analyzer](https://www.npmjs.com/package/node-analyzer)
* [nanalyzer](https://www.npmjs.com/package/nanalyzer)
* [THULAC:一个高效的中文词法分析工具包](http://thulac.thunlp.org/)
* [HanLP](https://github.com/hankcs/HanLP)
* [node-hanlp](https://github.com/beyai/node-hanlp)#### java
* [ansj_seg](https://github.com/NLPchina/ansj_seg)
### jp
* [rakutenma](https://www.npmjs.com/package/rakutenma)
* [tiny-segmenter](https://github.com/leungwensen/tiny-segmenter)## trie
* [aho-corasick](https://github.com/xudejian/aho-corasick)
* [aho-corasick-node](https://github.com/guofei/aho-corasick-node)## other
* [moji](https://github.com/niwaringo/moji) 半角全角変換ライブラリ
## post
* [Aho Corasick自动机结合DoubleArrayTrie极速多模式匹配](http://www.hankcs.com/program/algorithm/aho-corasick-double-array-trie.html)
* [DoubleArrayTrie和AhoCorasickDoubleArrayTrie的实用性对比](http://www.hankcs.com/program/algorithm/double-array-trie-vs-aho-corasick-double-array-trie.html)
* [层叠隐马模型下的音译人名和日本人名识别](http://www.hankcs.com/nlp/name-transliteration-cascaded-hidden-markov-model-and-japanese-personal-names-recognition.html)
* [简单有效的多标准中文分词](http://www.hankcs.com/nlp/segment/multi-criteria-cws.html)
* [词性标注](http://www.hankcs.com/nlp/part-of-speech-tagging.html)
* [形容詞和副詞的用法說明](http://210.240.55.2/~t311/moe/engb5/diagnose/adj_adv/adjandadv_e.htm)
* [形容詞 - 實用基礎文法](http://www.taiwantestcentral.com/Grammar/Title.aspx?ID=4)
* [什麼是複合形容詞?教你有力又到位的形容人事物!](https://tw.blog.voicetube.com/archives/11889 "Permalink to 【實用】什麼是複合形容詞?教你有力又到位的形容人事物!")
* [HanLP在线演示](http://hanlp.hankcs.com/)
* [全国名字ランキング](https://myoji-yurai.net/prefectureRanking.htm)
* [跳跃表,字典树(单词查找树,Trie树),后缀树,KMP算法,AC 自动机相关算法原理详细汇总](https://blog.csdn.net/zhongwen7710/article/details/39274349)
* [現代漢語常用簡繁一對多字義辨析表](http://ytenx.org/byohlyuk/KienxPyan)
* [ACG/Vocaloid同人粵語中文歌詞wiki](http://canto.acgvlyric.org/doku.php/w:%E7%94%A8%E5%AD%97%E6%A8%99%E6%BA%96?redirect=1)
* [如何分辨動詞、名詞、形容詞、副詞?](https://tw.answers.yahoo.com/question/index?qid=20130404000010KK03888)
* [簡正轉換別字表](http://founder.acgvlyric.org/iu/doku.php/w:%E7%B0%A1%E6%AD%A3%E8%BD%89%E6%8F%9B%E5%88%A5%E5%AD%97%E8%A1%A8?fbclid=IwAR2SOQvnvierjEc6goeHLGuOUz-awHlxTbOSN7IeYeI_vOmk3HBDXrLdT2U)
* http://www.kmuh.org.tw/www/drgga/book3/K10/K102.htm
* [文字上你可能常犯的錯](https://www.facebook.com/wordscooker/)
* [繁化姬 - 字典測試](https://gist.github.com/n6333373/f06a3aa27fcde0ba31c5955cfc33ca85)
* [CNS11643 中文全字庫](https://www.cns11643.gov.tw/index.jsp)
* [漢字辞典 - Jigen.net](https://jigen.net/)
* [教育部異體字字典](https://dict.variants.moe.edu.tw/variants/rbt/home.do)
*