text_mining_resources
Resources for learning about Text Mining and Natural Language Processing
https://github.com/stepthom/text_mining_resources
Last synced: 8 days ago
JSON representation
-
Misc
-
Lexicons for Sentiment Analysis
- A Complete Exploratory Data Analysis and Visualization for Text Data
- Detecting Gang-Involved Escalation on Social Media Using Context
- Reasoning about Actions and State Changes by Injecting Commonsense Knowledge - scale corp
- Using Natural Language Processing for Automatic Detection of Plagiarism
- Probabilistic Graphical Models: Lagrangian Relaxation Algorithms for Natural Language Processing
- Human Emotion
- A Complete Exploratory Data Analysis and Visualization for Text Data
-
-
Online courses
-
Knowledge Graphs
- Udemy: Deep Learning and NLP A-Z™: How to create a ChatBot
- Udemy: Natural Language Processing with Deep Learning in Python
- Udemy: NLP - Natural Language Processing with Python
- Udemy: Deep Learning: Advanced NLP and RNNs
- Udemy: Natural Language Processing and Text Mining Without Coding
- Stanford CS 224N / Ling 284
- Coursera: Applied Text Mining in Python
- Coursera: Nartual Language Processing
- Coursera: Sequence Models for Time Series and Natural Language Processing
- Coursera: Coursera: Clinical Natural Language Processing
- DataCamp: Natural Language Processing Fundamentals in Python
- DataCamp: Sentiment Analysis in R: The Tidy Way
- DataCamp: Text Mining: Bag of Words
- DataCamp: Building Chatbots in Python
- DataCamp: Advanced NLP with spaCy
- Natural Language Processing | Dan Jurafsky, Christopher Manning
- CMU CS 11-747: Neural Network for NLP
- UT CS 388: Natural Language Processing
- Columbia: COMS W4705: Natural Language Processing
- Columbia: COMS E6998: Machine Learning for Natural Language Processing (Spring 2012)
- Machine Translation: Spring 2016
- Big Data University: Advanced Text Analytics – Getting Results with SystemT
- Udacity: Natural Language Processing Nanodegree
- Courses for "natural language processing" on Coursera
- Deep Learning Drizzle
- Natural Language Processing | Dan Jurafsky, Christopher Manning
- Deep Learning for NLP
- CMU CS 11-747: Neural Network for NLP
- YSDA NLP course
- CMU Language and Statistics II: (More) Empirical Methods in Natural Language Processing
- Columbia: COMS W4705: Natural Language Processing
- Columbia: COMS E6998: Machine Learning for Natural Language Processing (Spring 2012)
- Machine Translation: Spring 2016
- DataCamp: Sentiment Analysis in R: The Tidy Way
- DataCamp: Natural Language Processing Fundamentals in Python
- DataCamp: Text Mining: Bag of Words
-
-
Online Demos and Tools
-
Knowledge Graphs
- Stanford Parser
- Stanford CoreNLP
- word2vec demo
- sense2vec: Semantic Analysis of the Reddit Hivemind
- RegexPal
- Cognitive Computation Group - Part of Speech Tagging Demo - of-speech tagging, information extraction tasks etc.
- Cognitive Computation Group - Part of Speech Tagging Demo - of-speech tagging, information extraction tasks etc.
- Another word2vec demo
- Another word2vec demo
- AllenNLP Demo
- sense2vec: Semantic Analysis of the Reddit Hivemind
- Cognitive Computation Group - Part of Speech Tagging Demo - of-speech tagging, information extraction tasks etc.
-
-
Other Curated Lists
-
Lexicons for Sentiment Analysis
- awesome-machine-learning
- Chinese NLP Tools
- Over 150 of the Best Machine Learning, NLP, and Python Tutorials I’ve Found
- Awesome Deep Learning for Natural Language Processing (NLP)
- Association for Computational Linguistics Papers Anthology
- Over 150 of the Best Machine Learning, NLP, and Python Tutorials I’ve Found
-
-
Products
-
Knowledge Graphs
- Systran - Enterprise Translation Products
- SAS Sentiment Analysis
- STATISTICA
- Text Mining (Big Data, Unstructured Data)
- Gate
- Video: How IBM Watson learns (3 minutes)
- Video: IBM Watson on Jeapardy! (10 minutes)
- Video: IBM Watson: The Science Behind an Answer (7 minutes)
- Stocktwits
- Meltwater
- Lexalytics Sematria
- Alchemy API
- brat
- Ask Data by Tableau Software Inc. - on to help assist existing Tableau platform users with retrieving quick and easy data visualizations to drive business intelligence insights. Similar to a search engine user interface, Tableau’s Ask Data feature interface applies NLP from user text input to extract key words to find data analytics and business insights quickly on the Tableau Platform.
- Microsoft Azure Text Analytics
- Amazon Lex
- Amazon Comprehend
- Apache PDFBox
- SO: How to extract text from a PDF?
- Tools for Extracting Data and Text from PDFs - A Review
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- Lyrebird.ai - Realistic Voice Cloning and Text-to-Speech” recognition platform. This Canadian start-up has created a product/platform that syncs both voice cloning with text-to-speech. Lyrebird recognizes the intonations and voice patterns from audio recordings, and overlays text data input to recreate a text-to-speech audio file output from the selected voice pattern audio recording.
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- Dialogflow
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- How I used NLP (SpaCy) to screen Data Science Resumes
- LightTag Annotation Tool
- Anafora - based raw text annotation tool
- PDFLayoutTextStripper: Converts a pdf file into a text file while keeping the layout of the original pdf.
- pdftabextract: A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
- Tools for Extracting Data and Text from PDFs - A Review
- PyPDF2
- Ask Data by Tableau Software Inc. - on to help assist existing Tableau platform users with retrieving quick and easy data visualizations to drive business intelligence insights. Similar to a search engine user interface, Tableau’s Ask Data feature interface applies NLP from user text input to extract key words to find data analytics and business insights quickly on the Tableau Platform.
- SAS Sentiment Analysis
- RapidMiner
- Annotation Lab - Free End-to-End No-Code platform for text annotation and DL model training/tuning. Out-of-the-box support for Named Entity Recognition, Classification, Relation extraction and Assertion Status Spark NLP models. Unlimited support for users, teams, projects, documents.
- Stocktwits
- Meltwater
- CrowdFlower
- Alchemy API
- Monkey Learn
- Google's Colab - to-go Notebook environment that makes it easy to get up and running.
- Ask Data by Tableau Software Inc. - on to help assist existing Tableau platform users with retrieving quick and easy data visualizations to drive business intelligence insights. Similar to a search engine user interface, Tableau’s Ask Data feature interface applies NLP from user text input to extract key words to find data analytics and business insights quickly on the Tableau Platform.
- Microsoft Azure Text Analytics
- Tabula: A tool for liberating data tables locked inside PDF files.
- SO: How to extract text from a PDF?
- How I used NLP (SpaCy) to screen Data Science Resumes
-
Programming Languages
Categories
Sub Categories
Knowledge Graphs
252
Sentiment Analysis
110
Q&A Systems, Chatbots <a id="qa-systems"></a>
107
Concept Analysis/Topic Modeling <a id="concept-analysis"></a>
61
Transformers and Language Models
57
Lexicons for Sentiment Analysis
53
General <a id="general-articles"></a>
43
Word and Document Embeddings
33
Document Classification
31
Cleaning
29
Scraping
24
Deep Learning
22
Document Clustering and Document Similarity
10
Fuzzy Matching, Probabilistic Matching, Record Linkage, Etc. <a id="fuzzy-matching"></a>
9
Machine Translation
8
Stemming
6
Biases in NLP
6
Dimensionality Reduction
6
Sarcasm Detection
5
Entity and Information Extraction
5
Text Summarization
4
Stop Words
2
Keywords
natural-language-processing
10
nlp
9
machine-learning
8
python
6
text-mining
4
pdf
3
word-embeddings
3
pytorch
3
deep-learning
3
tensorflow
3
text-visualization
2
topic-modeling
2
computational-social-science
2
transformer
2
record-linkage
2
python-library
2
twitter
2
dataset
2
entity-resolution
2
dedupe
2
r
1
tidy-data
1
tidyverse
1
dedupe-library
1
data-extraction
1
extract
1
java
1
layout
1
pdfbox
1
text
1
attention
1
bert
1
paper
1
tutorial
1
bot
1
trading
1
trump
1
leaderboard
1
visual-analysis
1
bots
1
chatbot
1
chatgpt
1
chatgpt-api
1
language
1
artificial-intelligence-algorithms
1
artificial-neural-networks
1
bayesian-statistics
1
computer-vision
1
deep-neural-networks
1
deep-reinforcement-learning
1