Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/snoop2head/instagram_hashtag_analysis
๐ท Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec & scikit-learn TF-IDF
https://github.com/snoop2head/instagram_hashtag_analysis
adjective gensim gensim-word2vec instagram-hashtag-analysis konlpy natural-language-processing noun scikit-learn scikitlearn tf-idf word2vec
Last synced: about 1 month ago
JSON representation
๐ท Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec & scikit-learn TF-IDF
- Host: GitHub
- URL: https://github.com/snoop2head/instagram_hashtag_analysis
- Owner: snoop2head
- License: mit
- Created: 2020-01-30T04:36:20.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2020-07-05T04:22:15.000Z (about 4 years ago)
- Last Synced: 2024-07-26T23:49:30.777Z (about 1 month ago)
- Topics: adjective, gensim, gensim-word2vec, instagram-hashtag-analysis, konlpy, natural-language-processing, noun, scikit-learn, scikitlearn, tf-idf, word2vec
- Language: Jupyter Notebook
- Homepage: https://gaemin.tistory.com/category/Project%20Based%20Learning/%EC%9A%B4%EB%8F%99%20%EC%B6%94%EC%B2%9C%20%EC%9B%B9%EC%84%9C%EB%B9%84%EC%8A%A4%20-%20FitCuration
- Size: 59.6 KB
- Stars: 12
- Watchers: 2
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# instagram_hashtag_analysis
Crawl and Analyze Instagram Hashtag Data## Header Numbers for files
* 0: Crawl Instagram posts according to search result of #keyword
* 1: Create and wrangle dataset with pandas
* 2: KoNLPy tagging for Koran nouns, Korean action words
* 3: Extract similar documents and make word2Vec models with gensim
* 4: TF-IDF code without using scikit-learn library
* 5: Extracting similar documents using scikit-learn library's tfidfvectorizer## ๋ฌธ์ ์์ ์๋ ๋ฒํธ๋ ๋ค์์ ์๋ฏธํจ
* 0: #keyword ๊ฒ์, ํด์ํ๊ทธ ๊ธฐ๋ฐ ์ธ์คํ๊ทธ๋จ ํฌ๋กค๋ง* 1: ์ธ์คํ๊ทธ๋จ ๋ฐ์ดํฐ ํตํฉ ๋ฐ ์กฐ์ - Pandas ๋ชจ๋ ์ด์ฉ
* 2: KoNLPy ํํ์๋ถ์ -> ์ต๋ ๋น๋ ์ฒด์ธ(๋ช ์ฌ), ์์ ์ด(๋์ฌ, ํ์ฉ์ฌ) ๋์ถ
* 3: Gensim์ ์ด์ฉํ Word2Vec ๋ชจ๋ธ ๋์ถ ๋ฐ ์ ์ฌ ๋ฌธ์ ์ถ์ถ
* 4: scikitlearn ๋ชจ๋์ ์ฌ์ฉํ์ง ์์, Vanilla๋ก ์์ฑํ TF-IDF ์์
* 5: scikitlearn ๋ชจ๋์ TF-IDF Vectorizer์ ์ด์ฉํ ์ ์ฌ ๋ฌธ์ ๋์ถ