{"id":13566249,"url":"https://github.com/snoop2head/instagram_hashtag_analysis","last_synced_at":"2025-04-03T23:31:26.164Z","repository":{"id":106550845,"uuid":"237139562","full_name":"snoop2head/instagram_hashtag_analysis","owner":"snoop2head","description":"📷 Crawl and Analyze Instagram Hashtag Data: KoNLPY to gensim word2Vec \u0026 scikit-learn TF-IDF","archived":false,"fork":false,"pushed_at":"2020-07-05T04:22:15.000Z","size":61,"stargazers_count":12,"open_issues_count":0,"forks_count":4,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-11-04T20:42:22.622Z","etag":null,"topics":["adjective","gensim","gensim-word2vec","instagram-hashtag-analysis","konlpy","natural-language-processing","noun","scikit-learn","scikitlearn","tf-idf","word2vec"],"latest_commit_sha":null,"homepage":"https://gaemin.tistory.com/category/Project%20Based%20Learning/%EC%9A%B4%EB%8F%99%20%EC%B6%94%EC%B2%9C%20%EC%9B%B9%EC%84%9C%EB%B9%84%EC%8A%A4%20-%20FitCuration","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/snoop2head.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2020-01-30T04:36:20.000Z","updated_at":"2022-10-05T14:01:30.000Z","dependencies_parsed_at":null,"dependency_job_id":"dfc31b18-5265-4311-a1d1-4f413acb8f60","html_url":"https://github.com/snoop2head/instagram_hashtag_analysis","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snoop2head%2Finstagram_hashtag_analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snoop2head%2Finstagram_hashtag_analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snoop2head%2Finstagram_hashtag_analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snoop2head%2Finstagram_hashtag_analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/snoop2head","download_url":"https://codeload.github.com/snoop2head/instagram_hashtag_analysis/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247097878,"owners_count":20883125,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["adjective","gensim","gensim-word2vec","instagram-hashtag-analysis","konlpy","natural-language-processing","noun","scikit-learn","scikitlearn","tf-idf","word2vec"],"created_at":"2024-08-01T13:02:05.430Z","updated_at":"2025-04-03T23:31:23.736Z","avatar_url":"https://github.com/snoop2head.png","language":"Jupyter Notebook","funding_links":[],"categories":["Jupyter Notebook"],"sub_categories":[],"readme":"# instagram_hashtag_analysis\nCrawl and Analyze Instagram Hashtag Data\n\n## Header Numbers for files\n\n* 0: Crawl Instagram posts according to search result of #keyword\n* 1: Create and wrangle dataset with pandas\n* 2: KoNLPy tagging for Koran nouns, Korean action words\n* 3: Extract similar documents and make word2Vec models with gensim\n* 4: TF-IDF code without using scikit-learn library\n* 5: Extracting similar documents using scikit-learn library's tfidfvectorizer\n\n## 문서 앞에 있는 번호는 다음을 의미함\n* 0: #keyword 검색, 해시태그 기반 인스타그램 크롤링\n\n* 1: 인스타그램 데이터 통합 및 조작 - Pandas 모듈 이용\n\n* 2: KoNLPy 형태소분석 -\u003e 최대 빈도 체언(명사), 서술어(동사, 형용사) 도출\n\n* 3: Gensim을 이용한 Word2Vec 모델 도출 및 유사 문서 추출\n\n* 4: scikitlearn 모듈을 사용하지 않은, Vanilla로 작성한 TF-IDF 예제\n\n* 5: scikitlearn 모듈의 TF-IDF Vectorizer을 이용한 유사 문서 도출\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsnoop2head%2Finstagram_hashtag_analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsnoop2head%2Finstagram_hashtag_analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsnoop2head%2Finstagram_hashtag_analysis/lists"}