https://github.com/keivanipchihagh/fun-text-mining-with-simpsons
Exploratory data analysis for approximately 600 Simpsons episodes and scripts, topic modeling and text generation.
https://github.com/keivanipchihagh/fun-text-mining-with-simpsons
bag-of-words sentiment-analysis simpsons tf-idf topic-modeling word2vec wordcloud
Last synced: about 1 month ago
JSON representation
Exploratory data analysis for approximately 600 Simpsons episodes and scripts, topic modeling and text generation.
- Host: GitHub
- URL: https://github.com/keivanipchihagh/fun-text-mining-with-simpsons
- Owner: keivanipchihagh
- License: apache-2.0
- Created: 2021-06-03T07:32:32.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-07-13T18:44:54.000Z (about 4 years ago)
- Last Synced: 2025-04-07T08:37:34.440Z (6 months ago)
- Topics: bag-of-words, sentiment-analysis, simpsons, tf-idf, topic-modeling, word2vec, wordcloud
- Language: Jupyter Notebook
- Homepage:
- Size: 25.3 MB
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Fun Text Mining with Simpsons
Exploratory data analysis for approximately 600 Simpsons episodes and scripts, topic modeling and text generation.## Table of Content
- Identify most on-screen characters
- Group the most common words for each character (WordClouds)
- Vectorizing
- Bag-Of-Words
- TF-IDF
- Word-Vector (Google's Word2Vec)
- Sentiment Analysis
- Topic Modeling
- Best & worst episode## Credits, Sources & Links
- [Kaggle Notebook](https://www.kaggle.com/ambarish/fun-in-text-mining-with-simpsons)