An open API service indexing awesome lists of open source software.

https://github.com/jshyunbin/comment_crawler

Web crawler for online shopping mall comments using python selectolax and requests.
https://github.com/jshyunbin/comment_crawler

python webcrawler

Last synced: 9 months ago
JSON representation

Web crawler for online shopping mall comments using python selectolax and requests.

Awesome Lists containing this project

README

          

# πŸ› Comment Crawler

This web crawler targets korean online shopping mall comments.

```zsh
python src/main.py -url "http://item.gmarket.co.kr/Item?goodscode=2405613985&ver=638083484789237196"
```

## Supporting Websites

- [Gλ§ˆμΌ“](http://www.gmarket.co.kr)
- [11λ²ˆκ°€](https://www.11st.co.kr) (wip)
- [Naver shopping](https://shopping.naver.com/home)
- [coupang](https://www.coupang.com) (wip)

## How to use

Comment crawler uses flags to specify

| Flags | type | description | default |
|---------------|---------|------------------------------------------------------------------|----------|
| url | string | URL link of the online shopping website. | None |
| max_collect | int | Maximum number of comments to collect. -1 if no limit. | -1 |
| collect_empty | boolean | Collects empty comments if set to True | True |
| browser | enum | Browser used for selenium. Can choose between Safari and Firefox | 'Safari' |