An open API service indexing awesome lists of open source software.

https://github.com/howl-anderson/corpus_dataset_for_chinese_nlp

中文 NLP 语料库数据集
https://github.com/howl-anderson/corpus_dataset_for_chinese_nlp

Last synced: 7 months ago
JSON representation

中文 NLP 语料库数据集

Awesome Lists containing this project

README

          

# corpus_dataset_for_Chinese_NLP
## Academic institution provided
### Fudan Natural Language Processing Group
URL: http://nlp.fudan.edu.cn/
* [Chinese Word Segmentation and POS Tagging for Micro-Blog Texts](http://nlp.fudan.edu.cn/data/)
* [Multi-task Learning for Text Classification](http://nlp.fudan.edu.cn/data/)
* [Neural Sentence Ordering](http://nlp.fudan.edu.cn/data/)

### NLP and Big Data Research Group in the ISTD pillar at the Singapore University of Technology and Design
URL: http://www.statnlp.org/software/dataset
* Multilingual Geoquery
* MalwareTextDB
* Multilingual ATIS
* NP-annotated SMS dataset

### THUOCL:清华大学开放中文词库
URL: http://thuocl.thunlp.org/

### “学堂在线”课程中文分词和词性标注语料库
URL: http://nlp.csai.tsinghua.edu.cn/site2/index.php/en/resources/195-xuetangxccorpus1-0