https://github.com/ospoon/scrapy_1209
scrapy_1209
https://github.com/ospoon/scrapy_1209
Last synced: 10 months ago
JSON representation
scrapy_1209
- Host: GitHub
- URL: https://github.com/ospoon/scrapy_1209
- Owner: OSpoon
- Created: 2018-12-09T14:14:30.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-12-19T15:16:57.000Z (over 7 years ago)
- Last Synced: 2025-05-19T05:05:40.650Z (about 1 year ago)
- Language: Python
- Size: 5.95 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
### Scrapy爬虫框架
简述:
Scrapy中的各大组件
Scrapy引擎
Scheduler调度器
Downloader下载器
Spider负责获取页面
Item Pipelin负责持久化
安装命令:pip install Scrapy
创建项目:scrapy startproject cnblogSpider
创建爬虫:scrapy genspider cnblogs "cnblogs.com"
执行爬虫:scrapy crawl cnblogs
scrapy crawl cnblogs -o papers.cvs