Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ryanckulp/amz_reviews
Amazon review scraper - download as CSV with auto saving and full pagination
https://github.com/ryanckulp/amz_reviews
amazon amz reviews scraper
Last synced: 2 months ago
JSON representation
Amazon review scraper - download as CSV with auto saving and full pagination
- Host: GitHub
- URL: https://github.com/ryanckulp/amz_reviews
- Owner: ryanckulp
- Created: 2023-05-13T16:25:16.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2023-12-28T12:14:36.000Z (about 1 year ago)
- Last Synced: 2024-10-04T15:28:12.319Z (3 months ago)
- Topics: amazon, amz, reviews, scraper
- Language: Ruby
- Homepage:
- Size: 5.86 KB
- Stars: 16
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Amazon Review Scraper
download a CSV of up to 5,000 reviews for any Amazon product. video tutorial for non dorks is [here]([url](https://twitter.com/ryanckulp/status/1657434781708066821)).#### video demo
90 second tutorial: https://youtu.be/p_z-nmyTU7Y?si=Psk3xz_xGl_rmcB_#### installation
first [install Ruby](https://install.founderhacker.com/steps/choose_os). then ensure the following Ruby gems are installed (`gem install `):* `watir`
* `headless`next (optional), move the `amz_reviews` file (from this repo) to an executable path. for Mac users this is often:
`mv amz_reviews /usr/local/bin`
note: this scraper uses Firefox by default. to switch to Chrome, replace `:firefox` with `:chrome` in the code but be forewarned - you may run into headaches. KISS.
#### using the scraper
simply:
`amz_reviews `(if you did not move the `amz_reviews` code to an executable path, run it via `./amz_reviews ` from the directory in which you saved it)
scraper parameters
* `asin` is Amazon's product identifier, which you can grab from any product page
* `page_start` and `page_end` are optional; Amazon returns up to 500 pages, or 5,000 reviews**output**
your terminal will produce a live log of scraping progress, which autosaves every 50 reviews. the finished result will be a CSV report inside the directory from which you ran the scraper.
sample report from `asin` "B07C7FS86W"
![image](https://github.com/ryanckulp/amz_reviews/assets/3083888/2b2561ec-7a0d-40e8-ab23-7307c9077943)