{"id":20450005,"url":"https://github.com/turboway/glidedsky","last_synced_at":"2025-04-13T02:11:08.673Z","repository":{"id":104902655,"uuid":"293473609","full_name":"TurboWay/glidedsky","owner":"TurboWay","description":"glidedsky 通关笔记","archived":false,"fork":false,"pushed_at":"2021-01-14T02:19:19.000Z","size":684,"stargazers_count":28,"open_issues_count":0,"forks_count":12,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-03-26T19:45:38.533Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"http://www.glidedsky.com/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/TurboWay.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-09-07T08:53:50.000Z","updated_at":"2024-09-15T10:55:05.000Z","dependencies_parsed_at":null,"dependency_job_id":"09b8eacd-1cd5-4bac-b220-5d5a47ad84a4","html_url":"https://github.com/TurboWay/glidedsky","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TurboWay%2Fglidedsky","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TurboWay%2Fglidedsky/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TurboWay%2Fglidedsky/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TurboWay%2Fglidedsky/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/TurboWay","download_url":"https://codeload.github.com/TurboWay/glidedsky/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248654090,"owners_count":21140236,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-15T10:49:39.678Z","updated_at":"2025-04-13T02:11:08.666Z","avatar_url":"https://github.com/TurboWay.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# glidedsky\nglidedsky 通关笔记\n\n[镀金的天空](http://www.glidedsky.com/) 是一个互联网技能认证网站，要保证用户解出一道题目就意味着拥有了解决类似问题相应的技能\n\n![image](https://github.com/TurboWay/imgstore/blob/master/glidedsky/process.jpg)\n\n\n## note\n\n- 爬虫采集属于 io 密集型操作，使用多线程并发可以提高效率，但是最佳并发数取决于爬虫的机器配置，而不是越多越好\n- 网络请求有时候会出错，重试是必要的，不用框架的话，装饰器是很好的选择\n- 使用代理 ip 时，网络错误导致漏爬的可能性很高，只有重试是不够的，先把结果存下来，做好补爬的准备，是比较稳妥的策略\n- 使用图片识别时，成功率不会达到 100%，所以多采集几次是必要的，对每个数取重复率最高的结果，是较好的做法\n\n## list\n| 代码 | 说明 |\n| ------------ | ------------ |\n| [crawler-basic-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-basic-1.py)       | 爬虫-基础1 |\n| [crawler-basic-2.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-basic-2.py)       | 爬虫-基础2 |\n| [crawler-captcha-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-captcha-1.py)       | 爬虫-验证码-1 |\n| [crawler-captcha-2.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-captcha-2.py)       | 爬虫-验证码-2 【网站服务异常，暂时无法审题】 |\n| [crawler-css-puzzle-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-css-puzzle-1.py)       | 爬虫-CSS反爬 |\n| [crawler-font-puzzle-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-font-puzzle-1.py)       | 爬虫-字体反爬-1 |\n| [crawler-font-puzzle-2.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-font-puzzle-2.py)       | 爬虫-字体反爬-2 |\n| [crawler-ip-block-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-ip-block-1.py)       | 爬虫-IP屏蔽1 |\n| [crawler-ip-block-2.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-ip-block-2.py)       | 爬虫-IP屏蔽2 |\n| [crawler-javascript-obfuscation-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-javascript-obfuscation-1.py)       | 爬虫-JS加密1 |\n| [crawler-sprite-image-1.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-sprite-image-1.py)       | 爬虫-雪碧图-1 |\n| [crawler-sprite-image-2.py](https://github.com/TurboWay/glidedsky/blob/master/crawler-sprite-image-2.py)       | 爬虫-雪碧图-2 |\n\n## refer\n\u003e滑动验证码 参考 https://github.com/ybsdegit/captcha_qq\n\u003e\n\u003e图片识别模型训练 https://github.com/TurboWay/antman\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fturboway%2Fglidedsky","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fturboway%2Fglidedsky","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fturboway%2Fglidedsky/lists"}