https://github.com/python3spiders/lianjiaspider
链家网爬虫
https://github.com/python3spiders/lianjiaspider
lianjia threadpoolexecutor webspider
Last synced: about 1 year ago
JSON representation
链家网爬虫
- Host: GitHub
- URL: https://github.com/python3spiders/lianjiaspider
- Owner: Python3Spiders
- License: apache-2.0
- Created: 2019-02-28T08:17:02.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-07-02T16:02:45.000Z (almost 7 years ago)
- Last Synced: 2025-04-03T14:43:45.904Z (about 1 year ago)
- Topics: lianjia, threadpoolexecutor, webspider
- Language: Python
- Size: 39.1 KB
- Stars: 80
- Watchers: 6
- Forks: 37
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# 项目简介
一个基于分页、线程池、代理池的链家网快速爬虫项目,速度可达 10000 条/5 分钟,严禁将所得数据商用!
同时对数据进行了清洗、分析、可视化。
欢迎提 issue,共同改进本项目!
# 作者简介
|作者|[inspurer](https://inspurer.github.io/2018/06/07/%E6%9C%88%E5%B0%8F%E6%B0%B4%E9%95%BF%E7%9A%84%E7%94%B1%E6%9D%A5/#more)|
|:---:|:---:|
|QQ交流群|[861016679](https://jq.qq.com/?_wv=1027&k=5Js6sKS)|
|个人博客|[https://inspurer.github.io/](https://inspurer.github.io/)|
更多精彩请关注公众号,微信扫描下方二维码或者在微信内搜索 **微信公众号:月小水长(ID:inspurer)**;