text_mining_resources
Resources for learning about Text Mining and Natural Language Processing
https://github.com/stepthom/text_mining_resources
Last synced: about 21 hours ago
JSON representation
-
Blog Articles, Papers, Case Studies
-
Sentiment Analysis
- CACM: Techniques and Applications for Sentiment Analysis
- Current State of Text Sentiment Analysis from Opinion to Emotion Mining
- Sentiment analysis, Concept analysis and Applications
- On the negativity of negation
- Does sentiment analysis work? A tidy analysis of Yelp reviews
- Sentdex: Quantifying the Qualitative
- Trump2Cash: A stock trading bot powered by Trump tweets - traded companies. A [related blog article](https://medium.com/@maxbraun/this-machine-turns-trump-tweets-into-planned-parenthood-donations-4ece8301e722#.3232hx7gx) describes a bot that turns Trump's tweets into Planned Parenthood donations.
- Sentiment Analysis of 2.2 million tweets from Super Bowl 51
- Lexicon-Based Methods for Sentiment Analysis - CAL (Semantic Orientation CALculator), a measure of subjectivity and opinion for sentimental analysis.
- Sentiment Analysis Tools Overview, Part 1. Positive and Negative Words Databases
- Data Science 101: Sentiment Analysis in R Tutorial
- That Sentimental Feeling - syuzhet-validation/).
- Forbes: How Quant Traders Use Sentiment To Get An Edge On The Market
- Breakthrough Research Papers and Models for Sentiment Analysis
- VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text - based model of sentiment analysis.
- Twitter sentiment analysis using combined LSTM-CNN models
- how-to-perform-sentiment-analysis-using-python-tutorial/
- Unsupervised Sentiment Analysis with Signed Social Networks
- Sentiment analysis, Concept analysis and Applications
- A comparison of Lexicon-based approaches for Sentiment Analysis of microblog posts - based approach for sentiment analysis of Twitter posts, based on lexical resources such as SentiWordNet.
- From tweets to polls: Linking text sentiment to public opinion time series
- Lost at Sea: How Social Media is Helping Cruise Lines Attract Millennials
- Streaming Analytics Tutorial on Azure
- How to Analyze sentiment in Azure
- Twitter Sentiment Analysis Overview - by-step walkthrough on how to perform sentiment analysis using TextBlob.
- ELMO embeddings in Keras using Tensorflow Hub
- Twitter Sentiment Analysis in Python using TextBlob
-
Stemming
-
Stop Words
-
Text Summarization
-
Transformers and Language Models
- Understanding Large Language Models
- A Primer in BERTology: What we know about how BERT works
- The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning)
- Machines Beat Humans on a Reading Test. But Do They Understand?
- the transformer … “explained”?
- The Illustrated Transformer
- OpenAI: Better Language Models and Their Implications - trained Transformer-based unsupervised language model that achieves state-of-the-art on many language benchmarks with focus on text generation. Controversial limited release. February 14, 2019.
- ChatGPT User Experience: Implications for Education
- New Modes of Learning Enabled by AI Chatbots: Three Methods and Assignments
- Educators Battle Plagiarism As 89% Of Students Admit To Using OpenAI’s ChatGPT For Homework
- ChatGPT: Educational friend or foe? - Pasek and Blinkoff (Temple University). January 2023.
- Don’t Ban ChatGPT in Schools. Teach With It.
- ChatGPT and the Future of Business Education
- Udemy course (January 2023). ChatGPT for Teachers in Education.
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- ChatGPT launch blog
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- A Primer in BERTology: What we know about how BERT works
- Awesome ChatGPT Prompts
- A review of BERT based models
- BERT Explained - State of the art language model for NLP
- Machines Beat Humans on a Reading Test. But Do They Understand?
- WHAT EVERY NLP ENGINEER NEEDS TO KNOW ABOUT PRE-TRAINED LANGUAGE MODELS
- Hugging Face's course on Transformer Models
- ChatGPT: Educational friend or foe? - Pasek and Blinkoff (Temple University). January 2023.
-
Word and Document Embeddings
- The Current Best of Universal Word Embeddings and Sentence Embeddings
- An Intuitive Understanding of Word Embeddings: From Count Vectors to Word2Vec
- An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation
- Document Embedding with Paragraph Vectors
- GloVe Word Embeddings Demo
- Text Classification With Word2Vec
- Document Embedding
- From Word Embeddings To Document Distances
- Word Embeddings, Bias in ML, Why You Don't Like Math, & Why AI Needs You
- Word Vectors in Natural Language Processing: Global Vectors (GloVe)
- Doc2Vec Tutorial on the Lee Dataset
- Word Embeddings in Python with SpaCy and Gensim
- Deep Contextualized Word Represenations - tf)
- Universal Language Model Fine-tuning for Text Classification
- Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
- Learned in Translation: Contextualized Word Vectors
- sense2vec
- Skip Thought Vectors
- The Amazing Power of Word Vectors
- Contextual String Embeddings for Sequence Labeling
- A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks - task learning approach for a set of interrelated NLP tasks. Presented at AAAI conference in January 2019.[Implementation code](https://github.com/huggingface/hmtl).
- An Idiot’s Guide to Word2vec Natural Language Processing
- Word2vec: fish + music = bass
- Universal Sentence Encoder Visually Explained
- NLP's ImageNet moment has arrived - trained NLP language models, drawing parallels to ImageNet's contributions to computer vision.
- Get Busy with Word Embeddings- An Introduction (February 2018)
- An Intuitive Understanding of Word Embeddings: From Count Vectors to Word2Vec
- Text Classification With Word2Vec
- From Word Embeddings To Document Distances
- Contextual String Embeddings for Sequence Labeling
- Sequence to Sequence Learning with Neural Networks
- An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation
- An Idiot’s Guide to Word2vec Natural Language Processing
- Document Embedding
- Doc2Vec Tutorial on the Lee Dataset
- Distributed Representations of Sentences and Documents - technologies.com/doc2vec-tutorial/)
- Word2vec: fish + music = bass
- Universal Sentence Encoder Visually Explained
-
-
Blogs
-
Books
- Mastering Text Mining with R
- Text Mining in Practice with R
- Natural Language Processing with Transformers, Revised Edition
- Getting Started with Natural Language Processing
- Blueprints for Text Analytics Using Python: Machine Learning-Based Solutions for Common Real World (NLP) Applications
- Practical Natural Language Processing
- Natural Language Processing with PyTorch
- Python Natural Language Processing
- Natural Language Processing: Python and NLTK
- Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning
- Applied Natural Language Processing With Python
- Taming Text: How to Find, Organize, and Manipulate It - on guide to learn innovative tools and techniques for finding, organizing, and manipulating unstructured text.
- Speech and Language Processing
- Foundations of Statistical Natural Language Processing
- Language Processing with Perl and Prolog: Theories, Implementation, and Application (Cognitive Technologies)
- Handbook of Natural Language Processing
- Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications
- Fundamentals of Predictive Text Mining
- Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More
- Neural Network Methods for Natural Language Processing
- Text Mining: A Guidebook for the Social Sciences
- Practical Text Analytics: Interpreting Text and Unstructured Data for Business Intelligence
- Neural Network Methods in Natural Language Processing
- Machine Learning for Text (2018)
- Natural Language Processing in Spanish
- Foundations of Computational Linguistics Human-Computer Communication in Natural Language
- Statistical Methods for Speech Recognition
- How To Label Data
- Practical Text Analytics: Interpreting Text and Unstructured Data for Business Intelligence
- Mastering Text Mining with R
- Statistical Methods for Speech Recognition
- Foundations of Statistical Natural Language Processing
- An introduction for information retrieval
- Mastering Text Mining with R
- Natural Language Processing with PyTorch
- Python Natural Language Processing
- Mastering Natural Language Processing with Python
- Natural Language Processing: Python and NLTK
- Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning
- Statistical Methods for Speech Recognition
- Text Mining with R
- Natural Language Processing with Python
- Deep Learning with Text
- Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications
-
Datasets
-
Knowledge Graphs
- data.world's Text Datasets
- Insight Resources Datasets
- Consumer Complaint Database
- Sentiment Labelled Sentences Data Set
- Amazon product data
- Data is Plural
- FiveThirtyEight's datasets
- r/datasets
- R's `datasets` package
- 200,000 Russian Troll Tweets - Released by Congress from Twitter suspended accounts and removed from public view.
-
Programming Languages
Categories
Sub Categories
Knowledge Graphs
271
Sentiment Analysis
120
Q&A Systems, Chatbots <a id="qa-systems"></a>
109
Concept Analysis/Topic Modeling <a id="concept-analysis"></a>
66
Transformers and Language Models
62
General <a id="general-articles"></a>
60
Lexicons for Sentiment Analysis
52
Word and Document Embeddings
38
Document Classification
35
Cleaning
29
Scraping
25
Deep Learning
24
Fuzzy Matching, Probabilistic Matching, Record Linkage, Etc. <a id="fuzzy-matching"></a>
11
Document Clustering and Document Similarity
10
Machine Translation
9
Biases in NLP
7
Dimensionality Reduction
7
Sarcasm Detection
6
Stemming
6
Entity and Information Extraction
6
Text Summarization
5
Stop Words
3
Keywords
natural-language-processing
10
nlp
9
machine-learning
8
python
6
text-mining
4
pdf
3
word-embeddings
3
pytorch
3
deep-learning
3
tensorflow
3
text-visualization
2
topic-modeling
2
computational-social-science
2
transformer
2
record-linkage
2
python-library
2
twitter
2
dataset
2
entity-resolution
2
dedupe
2
r
1
tidy-data
1
tidyverse
1
dedupe-library
1
data-extraction
1
extract
1
java
1
layout
1
pdfbox
1
text
1
attention
1
bert
1
paper
1
tutorial
1
bot
1
trading
1
trump
1
leaderboard
1
visual-analysis
1
bots
1
chatbot
1
chatgpt
1
chatgpt-api
1
language
1
artificial-intelligence-algorithms
1
artificial-neural-networks
1
bayesian-statistics
1
computer-vision
1
deep-neural-networks
1
deep-reinforcement-learning
1