Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-nlp

https://github.com/supertopdev/awesome-nlp

Last synced: 1 day ago
JSON representation

Techniques
- Text Summarization
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - TextRank- bringing order into text
  - Modelling compressions with Discourse constraints
  - Deep Recurrent Generative Decoder model for Abstractive Text Summarization - to-sequence oriented encoder-decoder model equipped with a deep recurrent generative decoder.
  - A Semantic Relevance Based Neural Network for Text Summarization and Text Simplification - decoder for text summarization.
  - TextSum
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
  - Example blogpost - Summarization-with-Amazon-Reviews).
- Named Entity Recognition
- Text Embeddings
  - Efficient Estimation of Word Representations in Vector Space
  - Word2Vec Resources on Github
  - GloVe: Global vectors for word representation
  - Pre-trained Vectors
  - arXiv: Enriching Word Vectors with Subword Information
  - HLBL language model
  - Improving Word Representations Via Global Context And Multiple Word Prototypes
  - Dependency based word embeddings
  - sense2vec - on word sense disambiguation
  - Infinite Dimensional Word Embeddings - new
  - Skip Thought Vectors - word representation method
  - Adaptive skip-gram - similar approach, with adaptive properties
  - Improving distributional similarity with lessons learned from word embeddings
  - Deep Contextualized Word Represenations - [PyTorch](https://github.com/allenai/allennlp/blob/master/tutorials/how_to/elmo.md) - [TF Implementation](https://github.com/allenai/bilm-tf)
  - Deep Learning, NLP, and Representations
  - Efficient Estimation of Word Representations in Vector Space
- Thought Vectors
  - Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
  - Distributed Representations of Sentences and Documents
  - Le - technologies.com/doc2vec-tutorial/)
  - Deep Recursive Neural Networks for Compositionality in Language
  - Semi-supervised Sequence Learning
  - Semi-supervised Sequence Learning
- Machine Translation
  - seq2seq tensorflow tutorial
  - arXiv: Sequence to Sequence Learning with Neural Networks
  - arXiv: Neural Machine Translation by jointly learning to align and translate
  - arXiv: A Convolutional encoder model for neural machine translation
  - Convolutional Sequence to Sequence learning
  - Convolutional over Recurrent Encoder for neural machine translation
  - blog post - decoder architecture with seq2seq models. [Tensorflow Code here](https://github.com/tensorflow/nmt)
  - arXiv: Sequence to Sequence Learning with Neural Networks
  - arXiv: Neural Machine Translation by jointly learning to align and translate
- Dialogs and Conversational
  - A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
  - Recurrent Neural Network Language Model (RLM) architecture of (Mikolov et al., 2010).
  - Implementing RNN Language Models by Denny Britz
  - Neural Responding Machine for Short-Text Conversation
  - arXiv: A Neural Conversation Model - XIAAAAJ) 2015. Uses LSTM RNNs to generate conversational responses
  - arXiv: A Neural Conversation Model - XIAAAAJ) 2015. Uses LSTM RNNs to generate conversational responses
  - A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
  - Neural Responding Machine for Short-Text Conversation
- Memory and Attention Models
- Natural Language Understanding
- Question Answering and Knowledge Extraction
- Text Classification
  - Convolutional Neural Networks for Sentence Classfication
  - Using a CNN for text classification in TensorFlow - text-classification-tf).
  - Character-level Convolutional Networks for Text Classification
Tutorials
- Reading Content
- Videos and Online Courses
  - Udacity's Intro to Artificial Intelligence
  - Lecture Slides and Reading Material here
Libraries
- Books
  - Twitter-text - A JavaScript implementation of Twitter's text processing library
  - gensim - Python library to conduct unsupervised semantic modelling from plain text :+1:
  - CRFsuite - CRFsuite is an implementation of Conditional Random Fields (CRFs) for labeling sequential data.
  - Practical Natural Language Processing done in Ruby
- Services
  - Amazon Comprehend - NLP and ML suite covers most common tasks like NER, tagging, and sentiment analysis
  - ParallelDots - State of the art Text Analysis API Service ranging from Sentiment Analysis to Intent Analysis
  - Microsoft Cognitive Service
  - TextRazor
NLP in Korean
- Libraries
  - Mecab (Korean) - C++ library for Korean NLP
  - KoalaNLP - Scala library for Korean Natural Language Processing.
  - KoNLP - R package for Korean Natural language processing
- Blogs and Tutorials
  - dsindex's blog
  - Kangwon University's NLP course in Korean
- Datasets
  - KAIST Corpus - A corpus from the Korea Advanced Institute of Science and Technology in Korean.
  - Chosun Ilbo archive - dataset in Korean from one of the major newspapers in South Korea, the Chosun Ilbo.
NLP in Indic languages
- Corpora and Treebanks
  - Hindi Dependency Treebank - A multi-representational multi-layered treebank for Hindi and Urdu
  - Universal Dependencies Treebank in Hindi
NLP in Thai
- Corpora
  - Inter-BEST - A text corpus with 5 million words with word segmentation
  - Prime Minister 29 - Dataset containing speeches of the current Prime Minister of Thailand
- Other Languages
  - arXiv: BKTreeBank
  - ICU Tokenizer
Credits
- Other Languages
NLP in Spanish
- Corpora
  - Spanish Billion words corpus with Word2Vec embeddings
  - Spanish Billion words corpus with Word2Vec embeddings

Programming Languages

Categories

Techniques 96 Tutorials 8 Libraries 8 NLP in Korean 7 NLP in Thai 4 Credits 3 NLP in Indic languages 2 NLP in Spanish 2

Sub Categories

Text Summarization 28 Text Embeddings 16 Machine Translation 9 Question Answering and Knowledge Extraction 9 Memory and Attention Models 8 Dialogs and Conversational 8 Thought Vectors 6 Reading Content 6 Other Languages 5 Natural Language Understanding 5 Named Entity Recognition 4 Books 4 Services 4 Corpora 4 Libraries 3 Text Classification 3 Blogs and Tutorials 2 Corpora and Treebanks 2 Datasets 2 Videos and Online Courses 2

Keywords

ruby 2 emoji 1 java 1 nodejs 1 objective-c 1 tweet 1 twitter 1 twitter-text 1 unicode 1 awesome 1 awesome-list 1 computational-linguistics 1 list 1 machine-learning 1 natural-language-processing 1 nlp 1 pos-tag 1 rubyml 1 rubynlp 1 sentiment-analysis 1