{"id":17040665,"url":"https://github.com/minsoo-web/py-scraper","last_synced_at":"2026-04-11T08:04:56.714Z","repository":{"id":45772093,"uuid":"224706891","full_name":"minsoo-web/py-scraper","owner":"minsoo-web","description":"python과 selenium을 통해 구현한 네이버 쇼핑몰 리뷰, 연관 검색어 크롤링 앱입니다.","archived":false,"fork":false,"pushed_at":"2020-12-31T06:35:59.000Z","size":2486,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-17T18:55:55.664Z","etag":null,"topics":["bs4","flask","python","scraper","selnium"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/minsoo-web.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-11-28T18:10:40.000Z","updated_at":"2022-07-16T05:51:31.000Z","dependencies_parsed_at":"2022-09-01T21:03:06.725Z","dependency_job_id":null,"html_url":"https://github.com/minsoo-web/py-scraper","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minsoo-web%2Fpy-scraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minsoo-web%2Fpy-scraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minsoo-web%2Fpy-scraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minsoo-web%2Fpy-scraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/minsoo-web","download_url":"https://codeload.github.com/minsoo-web/py-scraper/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245040189,"owners_count":20551301,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bs4","flask","python","scraper","selnium"],"created_at":"2024-10-14T09:10:11.221Z","updated_at":"2026-04-11T08:04:56.688Z","avatar_url":"https://github.com/minsoo-web.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003ch1 align=\"center\"\u003e\n    Python Web Scraper\n\u003c/h1\u003e\n\n\u003cp align=\"center\"\u003e\n    \u003ca href=\"https://img.shields.io\"\u003e\n        \u003cimg alt=\"python-version\" src=\"https://img.shields.io/badge/python%20version-3.8.2-blue\"\u003e\u003c/a\u003e\n    \u003ca href=\"https://img.shields.io\"\u003e\n        \u003cimg alt=\"lovePython\" src=\"https://img.shields.io/badge/love%20python%3F-yes%20%F0%9F%94%A5-%23FFE873\"\u003e\u003c/a\u003e\n    \u003ca href=\"https://img.shields.io\"\u003e\n        \u003cimg alt=\"lovePython\" src=\"https://img.shields.io/badge/flake8-pass-%23306998\"\u003e\u003c/a\u003e\n    \u003ca href=\"https://hits.seeyoufarm.com\"\u003e\u003cimg src=\"https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2FMinsoo-web%2Fpy-scraper\u0026count_bg=%233D6BC8\u0026title_bg=%23555555\u0026icon=\u0026icon_color=%23E7E7E7\u0026title=today\u0026edge_flat=false\"/\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n    \u003cimg alt=\"banner\" src=\"https://d33wubrfki0l68.cloudfront.net/2d12446f148533a20e1d9271da57012ca4c27766/69e14/blog/selenium-python/header_selenium_python_hu858c713577cea0e612703bbde5071118_85692_825x0_resize_catmullrom_2.png\" /\u003e\n\u003c/p\u003e\n\n[이미지 출처](https://www.scrapingbee.com/blog/selenium-python/)\n\n## 🚀 What is this\n\n`bs4`와 `selenium`을 이용해서 만든 웹스크레퍼입니다.\n\n1. [네이버 쇼핑몰](https://shopping.naver.com/)의 상세 페이지에서 리뷰들을 가져와 csv 파일로 추출할 수 있습니다.\n2. 특정 item의 연관 검색어를 list로 추출할 수 있습니다.\n3. 특정 item의 전체 검색 갯수를 int로 추출할 수 있습니다.\n4. 네이버 검색에서의 카테고리별 추천 순서를 모바일/데스크톱 화면 기준으로 추출할 수 있습니다.\n5. 네이버 검색에서의 섹션 순서를 모바일/데스크톱 화면 기준으로 추출할 수 있습니다.\n\n### 리뷰 추출 데이터\n\n\u003e 평점, 판매 회사(구매한 곳), 리뷰 작성 날짜, 제목, 내용\n\n## 🍿 Setup\n\n```bash\n# clone this project\n$ cd py-scraper\n$ pip install -r requirements.txt\n\n# 설치 리스트\nrequests\nFlask\nbeautifulsoup4\nbs4\nselenium\n```\n\n## 🔥 Run Program\n\n```bash\n# ~/your_work_space/py-scraper\n$ python app.py review -u {url} -l {limits of page} # 리뷰 크롤링\n$ python app.py related -i {search_item} # 연관 검색어\n$ python app.py keyword -i {search_item} # 쇼핑 키워드 순서 추출\n```\n\n### ✔️ 예제\n\n```bash\n# 예제 2 페이지만 스크랩\n$ python app.py review -u \"https://search.shopping.naver.com/catalog/15784793132?cat_id=50002334\u0026nv_mid=15784793132\u0026query=jbl+free+x\" -l 2\n# 예제 모든 페이지 스크랩 default 가 모든 페이지입니다. 함축표현 'csv' 로도 가능합니다.\n$ python app.py csv -u \"https://search.shopping.naver.com/catalog/15784793132?cat_id=50002334\u0026nv_mid=15784793132\u0026query=jbl+free+x\"\n\n# 인자 값 도움말\n$ python app.py --help\n```\n\n### ✔️ 연관 검색어 예제\n\n```bash\n# 네이버 쇼핑에서 제공하는 가디건과 관련된 연관 검색어를 추출\n$ python app.py related --item \"가디건\"\n# 함축 표현\n$ python app.py re -i \"롱패딩\"\n```\n\n### ✔️ 쇼핑 키워드 예제\n\n```bash\n# 네이버 검색에서 데스크톱 / 모바일 기준으로 쇼핑 카테고리 + 섹션 순서를 추출\n$ python app.py keyword --item \"키보드\"\n$ python app.py key -i \"물통\"\n```\n\n### 📷 실행 화면\n\n![실행 화면](./images/run.png)\n\n## 👀 result\n\n### naver_review.csv\n\n```csv\n평점,만족도,날짜,제목,내용\n1,1,18.10.08,리뷰제목,리뷰내용\n1,1,18.10.08,리뷰제목,리뷰내용\n1,1,18.10.08,리뷰제목,리뷰내용\n1,1,18.10.08,리뷰제목,리뷰내용\n...\n```\n\n### 연관 검색어\n\n![실행 화면](./images/run2.png)\n\n### 쇼핑 키워드\n\n![실행 화면](./images/shop_keyword.png)\n\n### 📷 리뷰 csv 파일\n\n![결과 화면](./images/result.png)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fminsoo-web%2Fpy-scraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fminsoo-web%2Fpy-scraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fminsoo-web%2Fpy-scraper/lists"}