Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nlpchina/nlp-lang
这个项目是一个基本包.封装了大多数nlp项目中常用工具
https://github.com/nlpchina/nlp-lang
java nlp nlp-lang tire
Last synced: 29 days ago
JSON representation
这个项目是一个基本包.封装了大多数nlp项目中常用工具
- Host: GitHub
- URL: https://github.com/nlpchina/nlp-lang
- Owner: NLPchina
- License: apache-2.0
- Created: 2014-03-30T13:38:45.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2024-04-18T02:16:29.000Z (7 months ago)
- Last Synced: 2024-10-14T17:43:24.898Z (29 days ago)
- Topics: java, nlp, nlp-lang, tire
- Language: Java
- Homepage:
- Size: 26.4 MB
- Stars: 1,492
- Watchers: 150
- Forks: 501
- Open Issues: 15
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
nlp-lang
========[![1.X Build Status](https://travis-ci.org/NLPchina/nlp-lang.svg?branch=master)](https://travis-ci.org/NLPchina/nlp-lang)
[![sourcegraph](https://sourcegraph.com/github.com/NLPchina/nlp-lang/-/badge.svg)](https://sourcegraph.com/github.com/NLPchina/nlp-lang?badge)##MAVEN
org.nlpcn
nlp-lang
1.7.6
#### 这个项目是一个基本包.封装了大多数nlp项目中常用工具
> 工具
* √ 词语标准化
* √ tire树结构
* √ 双数组tire树
* √ 文本断句
* √ html标签清理
* √ Viterbi算法增加> 组件
* √ 汉字转拼音
* √ 简繁体转换
* √ bloomfilter
* √ 指纹去重
* √ SimHash文章相似度计算
* √ 词共现统计
* √ 基于内存的搜索提示
* √ WordWeight词频统计,词idf统计,词类别相关度统计