https://github.com/shunk031/amebloscraper
Scraper for Ameblo in Scrapy
https://github.com/shunk031/amebloscraper
ameblo crawler scraper scrapy
Last synced: 3 months ago
JSON representation
Scraper for Ameblo in Scrapy
- Host: GitHub
- URL: https://github.com/shunk031/amebloscraper
- Owner: shunk031
- License: mit
- Created: 2016-04-18T13:53:15.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2017-08-19T03:33:10.000Z (about 8 years ago)
- Last Synced: 2025-02-28T03:32:55.395Z (8 months ago)
- Topics: ameblo, crawler, scraper, scrapy
- Language: Python
- Homepage:
- Size: 25.4 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AmebloScraper
Scraper for [Ameblo](https://ameblo.jp/) in Scrapy.
## Requirements
* Python 3.5.1
* Scrapy 1.4.0
* Pillow## How to run
crawl https://ameblo.jp/TARGET_BLOG and output blog.json
``` shell
scrapy crawl ameblo_scraper -a start_url='https://ameblo.jp/TARGET_BLOG' -o blog.json
```## Downloading images
Will be downloaded and stored in the following directory: `AmebloScraper/images/full/`