https://github.com/yowenter/jd-scrapy
https://github.com/yowenter/jd-scrapy
Last synced: 9 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/yowenter/jd-scrapy
- Owner: yowenter
- Created: 2015-04-05T15:24:48.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2015-04-06T15:10:55.000Z (almost 11 years ago)
- Last Synced: 2025-02-15T20:49:51.737Z (11 months ago)
- Language: Python
- Size: 121 KB
- Stars: 0
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README
Awesome Lists containing this project
README
Author:wenter
Email:wenter.wu@gmail.com
Org:DaoCloud
This is a crawler for jd.com .
Requires:
Python2.7
Scrapy
BeautifulSoup
RUN in jd project dir:
-scrapy crawl jd-category
ps:
You can edit the settings.py .
FILTER_CATEGORIES=['beauty']
That means the spider will crawl category 'beauty'.
If you wanna crawl more, you can add ,like this: FILTER_CATEGORIES=['beauty','car']