Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with jieba
A curated list of projects in awesome lists tagged with jieba .
https://github.com/anderscui/jieba.NET
jieba中文分词的.NET版本(支持.NET Framework与.NET Core)
Last synced: 19 Nov 2024
https://github.com/anderscui/jieba.net
jieba中文分词的.NET版本(支持.NET Framework与.NET Core)
Last synced: 15 Dec 2024
https://github.com/messense/jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
chinese-word-segmentation jieba jieba-chinese nlp wasm
Last synced: 17 Dec 2024
https://github.com/deepcs233/jieba_fast
Use C Api and Swig to Speed up jieba 高效的中文分词库
dag jieba python swig viterbi-hmm
Last synced: 15 Dec 2024
https://github.com/sing1ee/elasticsearch-jieba-plugin
jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1
dict elasticsearch elasticsearch-jieba-plugin jieba stopwords
Last synced: 07 Dec 2024
https://github.com/lining0806/textmining
Python文本挖掘系统 Research of Text Mining System
jieba sklearn stopwords text-mining tf-idf user-dict
Last synced: 17 Dec 2024
https://github.com/GaoQ1/rasa_nlu_gq
turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)
bert bilstm-idcnn jieba natural-language nlp nlu rasa rasa-nlu rasa-nlu-gao tensorflow
Last synced: 02 Nov 2024
https://github.com/fendouai/Chinese-Text-Classification
Chinese-Text-Classification,Tensorflow CNN(卷积神经网络)实现的中文文本分类。QQ群:522785813,微信群二维码:http://www.tensorflownews.com/
chinese cnn cnn-text-classification jieba tensorflow text-classification
Last synced: 31 Oct 2024
https://github.com/fendouai/chinese-text-classification
Chinese-Text-Classification,Tensorflow CNN(卷积神经网络)实现的中文文本分类。QQ群:522785813,微信群二维码:http://www.tensorflownews.com/
chinese cnn cnn-text-classification jieba tensorflow text-classification
Last synced: 18 Dec 2024
https://github.com/lb2281075105/python-wechat-itchat
微信机器人,基于Python itchat接口功能实例展示:01-itchat获取微信好友或者微信群分享文章、02-itchat获取微信公众号文章、03-itchat监听微信公众号发送的文章、04 itchat监听微信群或好友撤回的消息、05 itchat获得微信好友信息以及表图对比、06 python打印出微信被删除好友、07 itchat自动回复好友、08 itchat微信好友个性签名词云图、09 itchat微信好友性别比例、10 微信群或微信好友撤回消息拦截、11 itchat微信群或好友之间转发消息
beautifulsoup4 bs4 echarts itchat jieba matplotlib matplotlib-live numpy os pandas pillow system time uuid wechat
Last synced: 17 Nov 2024
https://github.com/houbb/segment
The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)
benchmark chinese dfa hmm java jieba jieba-analysis jieba-chinese nlp segment segmentation trie trie-tree
Last synced: 17 Dec 2024
https://github.com/hongzhaohua/jstarcraft-nlp
专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.
ansj corenlp elasticsearch hanlp ik java jcseg jieba language-detection lucene mmseg mynlp nlp solr thulac word
Last synced: 16 Nov 2024
https://github.com/hockyy/miteiru
Miteiru is an open source Electron video player to learn Chinese, Cantonese, and Japanese. It can play all Youtube and HTML 5 supported format (.mkv, .mp4, .mov, and many more) videos, and lots of supports on other subtitle formats (.srt, .ass, .vtt, and many more)
anime cantonese chinese electron hanzi hiragana japanese jieba jmdict jyutping kanji katakana kuromoji mecab player subtitle video video-player
Last synced: 17 Dec 2024
https://github.com/fengkx/jieba-wasm
WASM binding to jieba-rs
chinese chinese-segmenter jieba wasm
Last synced: 15 Dec 2024
https://github.com/qiwihui/smsfilters
基于机器学习的 iOS 中文垃圾短信过滤 App
ios-app ios-swift jieba machine-learning message-filtering swift swift4
Last synced: 22 Oct 2024
https://github.com/messense/cjieba-py
Python cffi binding to CppJieba
cffi chinese-word-segmentation jieba jieba-chinese python-bindings word-segmentation
Last synced: 08 Nov 2024
https://github.com/JackHCC/Word-Counting
利用jieba库对中文小说进行词频统计并进行简单的正则匹配,同时验证Zipf-Law(Use the jieba library to perform word frequency statistics on Chinese novels and perform simple regular matching, and verify Zipf-Law)
Last synced: 09 Nov 2024
https://github.com/xiaoandx/reptile
“Python四川疫情爬虫可视化统计”(以下简称《四川疫情可视化爬虫》)。2020年随着冬季的到来,新冠疫情病毒可谓是越发猖狂。近段时间我国的上海、新疆、内蒙古、黑龙江、四川等地相继出现了疫情反弹的现象,所幸及时得到控制,并没有让病毒向外扩散。这些病毒都是本土新增,并不是境外输入。四川新增新型冠状病毒肺炎确诊病例又本地与输入病例组成,全省其它市州无新增无症状感染者。为了更好统计四川的疫情每日数据,对比省内各市州的疫情情况。通过大数据分析十五天内疫情期间最热门的词汇。
Last synced: 08 Nov 2024
https://github.com/fumiama/jieba
Jiebago 的性能优化版, 支持从 io.Reader 加载字典
chinese chinese-characters chinese-language chinese-text-segmentation chinese-word-segmentation golang golang-library golang-package jieba jieba-analysis jieba-chinese
Last synced: 30 Oct 2024
https://github.com/moyuweiqing/cnki-analysis
使用python,从知网上爬取相关的数据,并进行数据分析,涉及到pycharm和jupyter notebook
jieba jupyter-notebook matplotlib networkx plotly pyecharts python
Last synced: 11 Nov 2024
https://github.com/vcaesar/gse-bind
Go efficient text segmentation ; Go 语言高性能分词, binding other language.
binding go golang gse javascript jieba node pyhton
Last synced: 13 Oct 2024
https://github.com/ailln/simple-jieba
✂️用 100 行实现简单版本的 jieba 分词
chinese-word-segmentation jieba jieba-chinese word-segmentation
Last synced: 18 Nov 2024
https://github.com/cleoold/markov_cn_node
Backend creating random Chinese sentences based on chat history / 根据消息记录生成随机句子
Last synced: 13 Oct 2024
https://github.com/luozijun/rust-jieba
Rust jieba
hamming-distance hidden-markov-model hmm jaccard jieba minhash mmseg simhash similarity
Last synced: 20 Dec 2024
https://github.com/beanwei/wms
WMS(货物管理系统),学习Flask全文搜索的练手Demo,基于Jieba和flask_whooshalchemyplus来做分词和搜索
flask flask-sqlalchemy jieba restful-api searchable whoosh wms
Last synced: 07 Nov 2024
https://github.com/william-zhan-bot/ptt_commet_temperature
以python情感分析,計算台灣ptt論壇政治板文章的評論風向
comments jieba nlp politics python sentiment-analysis snownlp web-crawler
Last synced: 19 Nov 2024
https://github.com/hsiehbocheng/segmentation-and-pos-tagging
Compare Jieba and Droidtown ArticutAPI word segmentation and post tagging, and use the self-introduction of each company in the three industries as data to analyze the use of nouns and verbs in each industry
droidtown jieba nlp-machine-learning tableau
Last synced: 14 Nov 2024
https://github.com/centre-for-humanities-computing/chinese-tokenizer
A Rusty way of tokenizing Chinese texts
Last synced: 09 Nov 2024