Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with jieba

A curated list of projects in awesome lists tagged with jieba .

https://github.com/go-ego/gse

Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others.

chinese english go gse hmm hmm-viterbi-algorithm japanese jieba nlp segment trie

Last synced: 16 Dec 2024

https://github.com/napi-rs/node-rs

Node.js bindings ❤️ Rust crates

bcrypt crc32c eslint hash jieba napi-rs node-api nodejs

Last synced: 17 Dec 2024

https://github.com/anderscui/jieba.NET

jieba中文分词的.NET版本(支持.NET Framework与.NET Core)

jieba lucene

Last synced: 19 Nov 2024

https://github.com/anderscui/jieba.net

jieba中文分词的.NET版本(支持.NET Framework与.NET Core)

jieba lucene

Last synced: 15 Dec 2024

https://github.com/messense/jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

chinese-word-segmentation jieba jieba-chinese nlp wasm

Last synced: 17 Dec 2024

https://github.com/deepcs233/jieba_fast

Use C Api and Swig to Speed up jieba 高效的中文分词库

dag jieba python swig viterbi-hmm

Last synced: 15 Dec 2024

https://github.com/sing1ee/elasticsearch-jieba-plugin

jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1

dict elasticsearch elasticsearch-jieba-plugin jieba stopwords

Last synced: 07 Dec 2024

https://github.com/fuqiuai/wordcloud

用python进行文本分词并生成词云

jieba python wordcloud

Last synced: 17 Dec 2024

https://github.com/fuqiuai/wordCloud

用python进行文本分词并生成词云

jieba python wordcloud

Last synced: 08 Nov 2024

https://github.com/lining0806/textmining

Python文本挖掘系统 Research of Text Mining System

jieba sklearn stopwords text-mining tf-idf user-dict

Last synced: 17 Dec 2024

https://github.com/GaoQ1/rasa_nlu_gq

turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)

bert bilstm-idcnn jieba natural-language nlp nlu rasa rasa-nlu rasa-nlu-gao tensorflow

Last synced: 02 Nov 2024

https://github.com/fendouai/Chinese-Text-Classification

Chinese-Text-Classification,Tensorflow CNN(卷积神经网络)实现的中文文本分类。QQ群:522785813,微信群二维码:http://www.tensorflownews.com/

chinese cnn cnn-text-classification jieba tensorflow text-classification

Last synced: 31 Oct 2024

https://github.com/fendouai/chinese-text-classification

Chinese-Text-Classification,Tensorflow CNN(卷积神经网络)实现的中文文本分类。QQ群:522785813,微信群二维码:http://www.tensorflownews.com/

chinese cnn cnn-text-classification jieba tensorflow text-classification

Last synced: 18 Dec 2024

https://github.com/lb2281075105/python-wechat-itchat

微信机器人,基于Python itchat接口功能实例展示:01-itchat获取微信好友或者微信群分享文章、02-itchat获取微信公众号文章、03-itchat监听微信公众号发送的文章、04 itchat监听微信群或好友撤回的消息、05 itchat获得微信好友信息以及表图对比、06 python打印出微信被删除好友、07 itchat自动回复好友、08 itchat微信好友个性签名词云图、09 itchat微信好友性别比例、10 微信群或微信好友撤回消息拦截、11 itchat微信群或好友之间转发消息

beautifulsoup4 bs4 echarts itchat jieba matplotlib matplotlib-live numpy os pandas pillow system time uuid wechat

Last synced: 17 Nov 2024

https://github.com/snailclimb/python

Python学习第三方库案例总结

itchat jieba wordcloud wxpy

Last synced: 20 Dec 2024

https://github.com/ixqbar/phpjieba

结巴中文分词之php扩展,适用php5,php7

jieba php php7

Last synced: 07 Nov 2024

https://github.com/moyuweiqing/bilibili-barrage-analysis

bilibili弹幕分析,包含爬虫、词云分析、词频分析、情感分析、构建衍生指标,可视化

jieba pandas pyecharts python requests selenium snownlp wordcloud

Last synced: 11 Nov 2024

https://github.com/houbb/segment

The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)

benchmark chinese dfa hmm java jieba jieba-analysis jieba-chinese nlp segment segmentation trie trie-tree

Last synced: 17 Dec 2024

https://github.com/hongzhaohua/jstarcraft-nlp

专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.

ansj corenlp elasticsearch hanlp ik java jcseg jieba language-detection lucene mmseg mynlp nlp solr thulac word

Last synced: 16 Nov 2024

https://github.com/hockyy/miteiru

Miteiru is an open source Electron video player to learn Chinese, Cantonese, and Japanese. It can play all Youtube and HTML 5 supported format (.mkv, .mp4, .mov, and many more) videos, and lots of supports on other subtitle formats (.srt, .ass, .vtt, and many more)

anime cantonese chinese electron hanzi hiragana japanese jieba jmdict jyutping kanji katakana kuromoji mecab player subtitle video video-player

Last synced: 17 Dec 2024

https://github.com/fengkx/jieba-wasm

WASM binding to jieba-rs

chinese chinese-segmenter jieba wasm

Last synced: 15 Dec 2024

https://github.com/qiwihui/smsfilters

基于机器学习的 iOS 中文垃圾短信过滤 App

ios-app ios-swift jieba machine-learning message-filtering swift swift4

Last synced: 22 Oct 2024

https://github.com/messense/rjieba-py

jieba-rs Python binding

jieba jieba-rs

Last synced: 18 Dec 2024

https://github.com/JackHCC/Word-Counting

利用jieba库对中文小说进行词频统计并进行简单的正则匹配,同时验证Zipf-Law(Use the jieba library to perform word frequency statistics on Chinese novels and perform simple regular matching, and verify Zipf-Law)

jieba mini-program python

Last synced: 09 Nov 2024

https://github.com/xiaoandx/reptile

“Python四川疫情爬虫可视化统计”(以下简称《四川疫情可视化爬虫》)。2020年随着冬季的到来,新冠疫情病毒可谓是越发猖狂。近段时间我国的上海、新疆、内蒙古、黑龙江、四川等地相继出现了疫情反弹的现象,所幸及时得到控制,并没有让病毒向外扩散。这些病毒都是本土新增,并不是境外输入。四川新增新型冠状病毒肺炎确诊病例又本地与输入病例组成,全省其它市州无新增无症状感染者。为了更好统计四川的疫情每日数据,对比省内各市州的疫情情况。通过大数据分析十五天内疫情期间最热门的词汇。

jieba pyecharts python3

Last synced: 08 Nov 2024

https://github.com/moyuweiqing/cnki-analysis

使用python,从知网上爬取相关的数据,并进行数据分析,涉及到pycharm和jupyter notebook

jieba jupyter-notebook matplotlib networkx plotly pyecharts python

Last synced: 11 Nov 2024

https://github.com/vcaesar/gse-bind

Go efficient text segmentation ; Go 语言高性能分词, binding other language.

binding go golang gse javascript jieba node pyhton

Last synced: 13 Oct 2024

https://github.com/SunDoge/jieba-rs

结巴中文分词Rust版(未完工)

jieba rust

Last synced: 09 Nov 2024

https://github.com/29dch/word_cloud

python制作词云项目

crawler jieba wordcloud

Last synced: 11 Nov 2024

https://github.com/qiwihui/swiftjiebademo

"结巴"中文分词的iOS Swift版本Demo

ios-swift jieba swift

Last synced: 24 Oct 2024

https://github.com/ailln/simple-jieba

✂️用 100 行实现简单版本的 jieba 分词

chinese-word-segmentation jieba jieba-chinese word-segmentation

Last synced: 18 Nov 2024

https://github.com/cleoold/markov_cn_node

Backend creating random Chinese sentences based on chat history / 根据消息记录生成随机句子

chinese jieba markov

Last synced: 13 Oct 2024

https://github.com/beanwei/wms

WMS(货物管理系统),学习Flask全文搜索的练手Demo,基于Jieba和flask_whooshalchemyplus来做分词和搜索

flask flask-sqlalchemy jieba restful-api searchable whoosh wms

Last synced: 07 Nov 2024

https://github.com/cathaysia/jieba_nvim

使用 jieba 对 nvim 中的句子进行分词来移动光标

jieba lua nvim

Last synced: 09 Oct 2024

https://github.com/william-zhan-bot/ptt_commet_temperature

以python情感分析,計算台灣ptt論壇政治板文章的評論風向

comments jieba nlp politics python sentiment-analysis snownlp web-crawler

Last synced: 19 Nov 2024

https://github.com/hsiehbocheng/segmentation-and-pos-tagging

Compare Jieba and Droidtown ArticutAPI word segmentation and post tagging, and use the self-introduction of each company in the three industries as data to analyze the use of nouns and verbs in each industry

droidtown jieba nlp-machine-learning tableau

Last synced: 14 Nov 2024

https://github.com/centre-for-humanities-computing/chinese-tokenizer

A Rusty way of tokenizing Chinese texts

jieba rust tokenizer

Last synced: 09 Nov 2024

https://github.com/freakwill/nlplearning

💬notes for learning nlp, such as nltk, jieba

gensim jieba nlp nltk word2vec

Last synced: 28 Nov 2024