https://github.com/nihaohello/proxypool
ip代理池,提供在线查询
https://github.com/nihaohello/proxypool
Last synced: 4 months ago
JSON representation
ip代理池,提供在线查询
- Host: GitHub
- URL: https://github.com/nihaohello/proxypool
- Owner: nihaohello
- Created: 2019-06-06T05:12:16.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-06-06T11:10:10.000Z (over 6 years ago)
- Last Synced: 2024-11-21T15:39:34.210Z (12 months ago)
- Language: Python
- Size: 9.77 KB
- Stars: 8
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-hacking-lists - nihaohello/proxypool - ip代理池,提供在线查询 (Python)
README
# proxypool
个人挂起的代理ips:http://youknowi.xin/ips_proxies.txt
0 0 * * * rm -rf "/home/www/htdocs/check_ips.txt" 每天零点删除check_ips.txt 因为个人是a+写文件来进行叠加的
0 */4 * * * python /var/ip_proxy/ip_proxies_crawl.py 每隔开4小时爬取一次最新的ips并验证
0 */2 * * * * python /var/ip_proxy/check_ip_proxies.py 每隔2小时检验一次ip的可用性
0 1 * * 3 python /var/ip_proxy/check_all_ip_proxies.py 每天零点检验一次all_ips历史的ips哪些能用 (最后还是换成每周三点:0 3 * * 0)
1.check_ips.txt 存放每小时爬取的ips 利用ip_proxies_crawl.py爬取西刺 快代理 iphai yunhai网站的当前存活ips
2.all_ips.txt 存放上万的历史代理ips check_all_ip_proxies每周三检测这个文件里ips的存活个数并保存
3.ips_proxies.txt 存放当前能用的ips check_ip_proxies.py每2小时检测一次存活性