Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/coghost/crawlers
crawlers in one
https://github.com/coghost/crawlers
crawler python3 staticimg weibo
Last synced: 2 days ago
JSON representation
crawlers in one
- Host: GitHub
- URL: https://github.com/coghost/crawlers
- Owner: coghost
- License: gpl-3.0
- Created: 2017-10-26T03:59:18.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2020-04-02T07:09:24.000Z (almost 5 years ago)
- Last Synced: 2024-11-09T02:52:29.627Z (about 2 months ago)
- Topics: crawler, python3, staticimg, weibo
- Language: Python
- Size: 266 KB
- Stars: 1
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
---
title: python 爬虫
---# crawlers
> crawlers in one `python 3.6`
## Thanks to
- [chenjiandongx](# https://github.com/chenjiandongx/awesome-spider)
- [爬虫攻防](https://www.zhuyingda.com/blog/article.html?id=17&origin=segment)
- [fuck-login](https://github.com/xchaoinfo/fuck-login)## DONE
- [x] 静态图片下载
- [44style](http://44.style/)
- [mmjpg](www.mmjpg.com)
- ...
- [x] google crx 插件爬取
- [chromecj](http://chromecj.com/)
- [cnplugins](http://www.cnplugins.com)
- [x] luoo 网音乐
- [x] one 读书
- [x] [sdifen周](http://www.sdifen.com/)
- [x] [伯乐python资源](http://hao.jobbole.com/?catid=144)
- [x] 电影查询
- [x] [电影天堂](http://www.dytt8.net/)
- [x][66ys](http://66ys.cc/)
- [x] 东奥会计题库
- [x] 代理## docker machines
### mongo
```sh
docker run --name luoo_mg \
-v /Luoo/db/data:/data/db \
-p :27017 \
-d mongo:latest --smallfiles
```### redis
> 切记: 在启动前需要先建立好 data 目录, 和 redis.conf 文件
- docker
```sh
docker run \
--name=crawl_redis \
-tid \
-p :6379 \
-v /Luoo/redis/data:/data \
-v /Luoo/redis/redis.conf:/usr/local/etc/redis/redis.conf \
redis redis-server /usr/local/etc/redis/redis.conf
```- `redis.conf`
```sh
port 6379
timeout 300
loglevel verbose
save 900 1
save 300 10
save 60 10000
rdbcompression yes
appendonly yes
appendfsync everysec
requirepass 123456
```