text_mining_resources
Resources for learning about Text Mining and Natural Language Processing
https://github.com/stepthom/text_mining_resources
Last synced: 3 days ago
JSON representation
-
Blog Articles, Papers, Case Studies
-
Dimensionality Reduction
-
Document Classification
- Naive Bayes and Text Classification - depth overview of both the Naive Bayes algorithm and how it can be used in the document classification process.
- Bag of Tricks for Efficient Text Classification
- Text Classifier Algorithms in Machine Learning
- Classifying Documents in the Reuters-21578 R8 Dataset
- Multi-Class Text Classification with Scikit-Learn - class problems, such as classifying consumer complaints into one of 12 categories.
- Machine Learning with Text in scikit-learn (PyCon 2016) - learn in the document classification process.
- Towards Explainable NLP: A Generative Explanation Framework for Text Classification
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Text Classification in Python with scikit-learn and nltk - learn.
- Learning Document Embeddings by Predicting N-grams for Sentiment Classification of Long Movie Reviews - paper with code on Github
- Towards Explainable NLP: A Generative Explanation Framework for Text Classification
- Introducing state of the art text classification with universal language models
-
Document Clustering and Document Similarity
- Text Clustering: Get quick insights from Unstructured Data
- Document Clustering
- Document Clustering: A Detailed Review
- Text mining and sentiment analysis on video game user reviews using SAS® Enterprise Miner
- Who wrote the anti-Trump New York Times op-ed? Using tidytext to find document similarity
- Text Clustering: Get quick insights from Unstructured Data
- Document Clustering with Python
- Who wrote the anti-Trump New York Times op-ed? Using tidytext to find document similarity
- Document Clustering
- Text mining and sentiment analysis on video game user reviews using SAS® Enterprise Miner
-
Entity and Information Extraction
- Entity Extraction and Network Analysis
- Natural Language Processing for Information Extraction
- NLP Techniques for Extracting Information - depth exploration of the seven steps framework of NLP data mining tools and techniques.
- NLP Techniques for Extracting Information - depth exploration of the seven steps framework of NLP data mining tools and techniques.
- Entity Extraction and Network Analysis
-
Fuzzy Matching, Probabilistic Matching, Record Linkage, Etc. <a id="fuzzy-matching"></a>
- agrep method in R
- fuzzywuzzy package in R
- Fuzzy String Matching – a survival skill to tackle unstructured information
- Fuzzy merge in R
- Learning Text Similarity with Siamese Recurrent Networks
- R package fastLink: Fast Probabilistic Record Linkage
- Learning Text Similarity with Siamese Recurrent Networks
- Dedupe - resolution.
- recordlinkage
-
General <a id="general-articles"></a>
- NLP in healthcare
- AI Harvard Business Review
- Why Accuracy in Natural Language Processing is Crucial to the Future of AI in Retail
- Natural Language Processing is Fun! How computers understand Human Language
- WEF Live Campaign - Twitter fed Global News Topics & Sentiment Tracker - Live Jan 2019
- From Natural Language to Calendar Entries, with Clojure
- Ask HN: How Can I Get into NLP (Natural Language Processing)?
- Ask HN: What are the best tools for analyzing large bodies of text?
- Quora: How do I learn Natural Language Processing?
- Quora Topic: Natural Language Processing
- The Definitive Guide to Natural Language Processing
- Futures of text
- R or Python on Text Mining
- Where to start in Text Mining
- Text Mining in R and Python: 8 Tips To Get Started
- Mining Twitter Data with Python (Part 1: Collecting Data)
- Why Text Mining May Be The Next Big Thing
- SAS CEO offers analytics over BI, reveals use cases for text analytics
- Value and benefits of text mining
- Text Mining South Park - A Text Mining blog which covers on a variety of topics.
- Natural Language Processing Tutorial
- An Introduction to Text Mining using Twitter Streaming API and Python
- How To Get Into Natural Language Processing
- Comparison of the Most Useful Text Processing APIs
- Natural language based financial forecasting: a survey
- 5 Heroic Tools for Natural Language Processing
- Natural Language Processing unlocks hidden data to transform healthcare efficiency, quality and cost
- Extracting medical problems from electronic clinical documents
- How to Write a Spelling Corrector - by Peter Norvig
- Using AI to unleash the power of unstructured government data - to-comprehend primer and background on NLP, and the various applications NLP could be used on unstructured Government text data. The article includes many US Government examples on how NLP is currently deployed across different domains (e.g. to help analyze public feedback/sentiment analysis/topic modelling, to improve forensic investigations, to aid in Government policy-making and regulatory compliance). The key point is to apply different NLP techniques to explore and uncover key Government intelligence insights.
- Extracting Features of Entertainment Products: A Guided Latent Dirichlet Allocation Approach Informed by the Psychology of Media Consumption - level consumption.” This academic article provides both a framework and managerial implications that suggest the application of LDA and NLP for feature extraction in entertainment products that can aid in traditional content-based consumer behavior models, and relevant marketing models applied to the media and entertainment industry.
- Lessons learned building natural language processing systems in health care
- How Algorithms Know What You’ll Type Next
- Natural Language Processing: An Introduction
- NLP in healthcare
- From Natural Language to Calendar Entries, with Clojure
- Quora: How do I learn Natural Language Processing?
- Natural Language Processing: An Introduction
- Betty: a friendly English-like interface for your command line.
- 100 Must-Read NLP Papers
- Crowdsourcing Ground Truth for Medical Relation Extraction
- Using AI to unleash the power of unstructured government data - to-comprehend primer and background on NLP, and the various applications NLP could be used on unstructured Government text data. The article includes many US Government examples on how NLP is currently deployed across different domains (e.g. to help analyze public feedback/sentiment analysis/topic modelling, to improve forensic investigations, to aid in Government policy-making and regulatory compliance). The key point is to apply different NLP techniques to explore and uncover key Government intelligence insights.
- SAS CEO offers analytics over BI, reveals use cases for text analytics
-
Knowledge Graphs
-
Machine Translation
- Blog Post: Found in translation: More accurate, fluent sentences in Google Translate
- NYTimes: The Great A.I. Awakening
- Machine Learning Translation and the Google Translate Algorithm
- The Annotated Transformer - by-line implementation of "Attention Is All You Need".
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding - research/bert). [Pytorch port.]( https://github.com/codertimo/BERT-pytorch)
- Phrase-Based & Neural Unsupervised Machine Translation - based model. Awarded as the Best Paper Award at EMNLP 2018. [Implementation code](https://github.com/facebookresearch/UnsupervisedMT).
- Paper Dissected: “Attention is All You Need” Explained
- Neural Machine Translation (seq2seq) Tutorial
-
Q&A Systems, Chatbots <a id="qa-systems"></a>
- Meet Lucy: Creating a Chatbot Prototype
- Microsoft Bot Framework
- Training Millions of Personalized Dialogue Agents
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- Building a Simple Chatbot from Scratch in Python (Using NLTK)- September 2018
- A Survey on Dialogue Systems: Recent Advances and New Frontiers
- Examining the Impact of an Automated Translation Chatbot on Online Collaborative Dialog for Incidental L2 Learning
- Generative Model Chatbots- May 2017
- A Guide to Building a Multi-Featured Slackbot with Python- March 2017
- The Road to a Conversational Banking Future-February 2019
- Chatbots - Designing intents and entities for NLP Models
- Task-oriented Dialogue System for Automatic Diagnosis
- Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
- NLP — Building a Question Answering model
- Ultimate Guide to Leveraging NLP & Machine Learning for your Chatbot
- The Road to a Conversational Banking Future-February 2019
-
Programming Languages
Categories
Sub Categories
Knowledge Graphs
256
Sentiment Analysis
110
Q&A Systems, Chatbots <a id="qa-systems"></a>
107
Concept Analysis/Topic Modeling <a id="concept-analysis"></a>
61
Transformers and Language Models
57
Lexicons for Sentiment Analysis
55
General <a id="general-articles"></a>
43
Word and Document Embeddings
33
Document Classification
31
Cleaning
29
Scraping
24
Deep Learning
22
Document Clustering and Document Similarity
10
Fuzzy Matching, Probabilistic Matching, Record Linkage, Etc. <a id="fuzzy-matching"></a>
9
Machine Translation
8
Stemming
6
Biases in NLP
6
Dimensionality Reduction
6
Sarcasm Detection
5
Entity and Information Extraction
5
Text Summarization
4
Stop Words
2
Keywords
natural-language-processing
10
nlp
9
machine-learning
8
python
6
text-mining
4
pdf
3
word-embeddings
3
pytorch
3
deep-learning
3
tensorflow
3
text-visualization
2
topic-modeling
2
computational-social-science
2
transformer
2
record-linkage
2
python-library
2
twitter
2
dataset
2
entity-resolution
2
dedupe
2
r
1
tidy-data
1
tidyverse
1
dedupe-library
1
data-extraction
1
extract
1
java
1
layout
1
pdfbox
1
text
1
attention
1
bert
1
paper
1
tutorial
1
bot
1
trading
1
trump
1
leaderboard
1
visual-analysis
1
bots
1
chatbot
1
chatgpt
1
chatgpt-api
1
language
1
artificial-intelligence-algorithms
1
artificial-neural-networks
1
bayesian-statistics
1
computer-vision
1
deep-neural-networks
1
deep-reinforcement-learning
1