Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-nlp-note
A curated list of resources dedicated to NLP (paper, blogs, note and etc)
https://github.com/eagle705/awesome-nlp-note
Last synced: 3 days ago
JSON representation
-
Libraries
-
Annotation Tools
- LightTag - Hosted and managed text annotation tool for teams, costs $
- Label Studio - source, configurable data annotation tool. Its purpose is to enable you to label different types of data using the most convenient interface with a standardized output format.
- LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019) - LIDA is an open source dialogue annotation system which supports the full pipeline of dialogue annotation from dialogue / turn segmentation from raw text
- Anafora - based raw text annotation tool
- brat - brat rapid annotation tool is an online environment for collaborative text annotation
- GATE - General Architecture and Text Engineering is 15+ years old, free and open source
- tagtog
- prodigy
- rstWeb - open source local or online tool for discourse tree annotations
- GitDox - open source server annotation tool with GitHub version control and validation for XML data and collaborative spreadsheet grids
- Label Studio - source, configurable data annotation tool. Its purpose is to enable you to label different types of data using the most convenient interface with a standardized output format.
- doccano - doccano is free, open-source, and provides annotation features for text classification, sequence labeling and sequence to sequence
-
Videos and Online Courses
- spaCy - Industrial strength NLP with Python and Cython :+1:
- textacy - Higher level NLP built on spaCy
- scattertext - Python library to produce d3 visualizations of how language differs between corpora
- GluonNLP - A deep learning toolkit for NLP, built on MXNet/Gluon, for research prototyping and industrial deployment of state-of-the-art models on a wide range of NLP tasks.
- AllenNLP - An NLP research library, built on PyTorch, for developing state-of-the-art deep learning models on a wide variety of linguistic tasks.
- PyTorch-NLP - NLP research toolkit designed to support rapid prototyping with better data loaders, word vector loaders, neural network layer representations, common NLP metrics such as BLEU
- Rosetta - Text processing tools and wrappers (e.g. Vowpal Wabbit)
- PyNLPl - Python Natural Language Processing Library. General purpose NLP library for Python. Also contains some specific modules for parsing common NLP formats, most notably for [FoLiA](https://proycon.github.io/folia/), but also ARPA language models, Moses phrasetables, GIZA++ alignments.
- jPTDP - A toolkit for joint part-of-speech (POS) tagging and dependency parsing. jPTDP provides pre-trained models for 40+ languages.
- BigARTM - a fast library for topic modelling
- Snips NLU - A production ready library for intent parsing
- Chazutsu - A library for downloading&parsing standard NLP research datasets
- Word Forms - Word forms can accurately generate all possible forms of an English word
- Multilingual Latent Dirichlet Allocation (LDA) - A multilingual and extensible document clustering pipeline
- Kashgari - Simple, Keras-powered multilingual NLP framework, allows you to build your models in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS) and text classification tasks. Includes BERT and word2vec embedding.
- gensim - Python library to conduct unsupervised semantic modelling from plain text :+1:
- NLP Architect - A library for exploring the state-of-the-art deep learning topologies and techniques for NLP and NLU
-
-
Blogs & Youtube
-
GitHub
- Chatbot convai2 (with retrieval via elastic)
- DL dev to production
- NL to SQL by BERT
- 제주어 번역 및 음성 합성(박규병님)
- pypapago nmt lib
- makcedward/nlpaug(NLP & Signal augmentation)
- lovit의 패스트캠퍼스, 자연어처리를 위한 머신러닝 github
- 한국어 문서 -> 문장 분류기 (중요)
- API basd Chatbot example
- NLP RedditSota
- yandex 강의
- 한글 자모 분리 툴킷
- 파이썬 오픈소스 챗봇 RasaHQ
- Customized KoNLPy
- 용래님 pytorch Transformer
- Korean NER Dataset Github
- 송영숙님 Korean Chitchat Dataset with Sentiment
- Chatbot API open source example
- Awesome Python
- Yunjey의 PyTorch Tutorial
- 개발자 기술 면접 정리
- NER_TensorFlow_2017_HCLT
- 이기창님 깃헙 블로그 소스
- 현재 쓰고 있는 깃헙 블로그 소스
- Pycon 2019 Tutorial GluonNLP tutorial
- NLP tutorial by lyeoni
- tmux 셋팅
- CRF!!! harvardnlp/pytorch-struct
- RL Chatbot1
- RL Chatbot2
- Evaluation Sentence Embedding (SentEval)
- python-mecab-ko
- beam search + nlp_mad_easy(박규병님)
- NL to SQL by BERT
- PyTorch Wrapper, pytorch-lightning
- RL Chatbot2
- 핑퐁에서 만든 띄어쓰기 모델_Chatspace
- matplotlib + 한글
- API basd Chatbot example
-
NLP in Korean
-
Libraries
- KoalaNLP - Scala library for Korean Natural Language Processing.
- Mecab (Korean) - C++ library for Korean NLP
-
Datasets
- NER dataset from 한국해양대학교 자연언어처리연구실
- conversational-AI-atasets(영어 대화 데이터셋)
- Korean WordNet
- KAIST Corpus - A corpus from the Korea Advanced Institute of Science and Technology in Korean.
- Chosun Ilbo archive - dataset in Korean from one of the major newspapers in South Korea, the Chosun Ilbo.
- PAWS and PAWS-X: Two New Datasets to Improve Natural Language Understanding Models_( Paraphrase Adversaries from Word Scrambling)
-
-
Tutorials
-
Videos and Online Courses
- Deep Natural Language Processing - Lectures series from Oxford
- Intro to Artificial Intelligence - Udacity course which touches upon NLP as well
- Deep Learning for Natural Language Processing (cs224-n) - Richard Socher and Christopher Manning's Stanford Course
- Neural Networks for NLP - Carnegie Mellon Language Technology Institute there
-
-
Research Summaries and Trends
- NLP-Overview - to-date overview of deep learning techniques applied to NLP, including theory, implementations, applications, and state-of-the-art results. This is a great Deep NLP Introduction for researchers.
- NLP-Progress - of-the-art for the most common NLP tasks
- NLP's ImageNet moment has arrived
- Four deep learning trends from ACL 2017. Part One: Linguistic Structure and Word Embeddings
- Four deep learning trends from ACL 2017. Part Two: Interpretability and Attention
- Survey of the State of the Art in Natural Language Generation
- ACL 2018 Highlights: Understanding Representation and Evaluation in More Challenging Settings
-
Environment
Programming Languages
Categories
Sub Categories
Keywords
nlp
18
machine-learning
15
natural-language-processing
15
deep-learning
14
python
9
chatbot
6
pytorch
5
text-classification
4
named-entity-recognition
4
nlu
4
dataset
4
ai
4
neural-network
3
data-science
3
ner
3
datasets
3
korean
3
spacy
3
bert
3
artificial-intelligence
3
mxnet
2
image-classification
2
gluonnlp
2
image-labeling
2
image-labelling-tool
2
label-studio
2
labeling
2
ml
2
labeling-tool
2
nlp-library
2
mlops
2
deeplearning
2
lstm
2
bot
2
chatbots
2
neural-networks
2
conversational-ai
2
machine-learning-library
2
semantic-segmentation
2
text-annotation
2
yolo
2
text-mining
2
topic-modeling
2
machine-translation
2
image-annotation
2
data-labeling
2
annotation
2
computer-vision
2
boundingbox
2
tensorflow
2