https://github.com/shunk031/lineblogscraper
Scraper for LINE Blog in Scrapy
https://github.com/shunk031/lineblogscraper
crawler lineblog scraper scrapy
Last synced: 8 months ago
JSON representation
Scraper for LINE Blog in Scrapy
- Host: GitHub
- URL: https://github.com/shunk031/lineblogscraper
- Owner: shunk031
- License: mit
- Created: 2017-08-16T11:34:47.000Z (about 8 years ago)
- Default Branch: master
- Last Pushed: 2017-09-30T09:50:20.000Z (about 8 years ago)
- Last Synced: 2025-01-10T17:53:45.609Z (9 months ago)
- Topics: crawler, lineblog, scraper, scrapy
- Language: Python
- Size: 10.7 KB
- Stars: 2
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# LINEBlogScraper
Scraper for [LINE BLOG](https://www.lineblog.me/) in Scrapy.
## Requirements
* Python 3.5.1
* Scrapy 1.4.0## How to run
crawl https://lineblog.me//TARGET_BLOG and output blog.json
``` shell
scrapy crawl lineblog_scraper -a start_url='https://lineblog.me/TARGET_BLOG' -o blog.json
```## Downloading images
Will be downloaded and stored in the following directory: `LINEBlogScraper/images/full/`