{"id":13694411,"url":"https://github.com/PythonLoversZone/Learn-Python-Crawler","last_synced_at":"2025-05-03T01:33:11.511Z","repository":{"id":95224857,"uuid":"104342801","full_name":"PythonLoversZone/Learn-Python-Crawler","owner":"PythonLoversZone","description":"Python爬虫学习路径图","archived":false,"fork":false,"pushed_at":"2017-10-21T08:11:59.000Z","size":767,"stargazers_count":47,"open_issues_count":0,"forks_count":10,"subscribers_count":3,"default_branch":"master","last_synced_at":"2024-08-03T17:19:51.864Z","etag":null,"topics":["python3"],"latest_commit_sha":null,"homepage":null,"language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/PythonLoversZone.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2017-09-21T12:02:29.000Z","updated_at":"2024-06-11T02:53:16.000Z","dependencies_parsed_at":"2023-04-05T00:01:50.006Z","dependency_job_id":null,"html_url":"https://github.com/PythonLoversZone/Learn-Python-Crawler","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PythonLoversZone%2FLearn-Python-Crawler","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PythonLoversZone%2FLearn-Python-Crawler/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PythonLoversZone%2FLearn-Python-Crawler/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PythonLoversZone%2FLearn-Python-Crawler/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/PythonLoversZone","download_url":"https://codeload.github.com/PythonLoversZone/Learn-Python-Crawler/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":224346588,"owners_count":17296245,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["python3"],"created_at":"2024-08-02T17:01:31.331Z","updated_at":"2024-11-12T20:32:18.096Z","avatar_url":"https://github.com/PythonLoversZone.png","language":"HTML","funding_links":[],"categories":["HTML"],"sub_categories":[],"readme":"# Python-crawler\n\n从零开始系统化的学习写Python爬虫。   \n主要是记录一下自己写Python爬虫的经过与心得。   \n同时也是为了分享一下如何能更高效率的学习写爬虫。    \nIDE：Vscode    Python版本: 3.6   \n\n每天的学习记录都会 同步更新到：\n* 微信公众号： findyourownway\n* 知乎专栏：https://zhuanlan.zhihu.com/Ehco-python\n* blog ： www.ehcoblog.ml\n\n详细学习路径：\n### 一：Beautiful Soup 爬虫\n\n* requests库的安装与使用 https://zhuanlan.zhihu.com/p/26681429\n* 安装beautiful soup 爬虫环境 https://zhuanlan.zhihu.com/p/26683864\n* beautiful soup 的解析器 https://zhuanlan.zhihu.com/p/26691931\n* re库 正则表达式的使用 https://zhuanlan.zhihu.com/p/26701898\n* bs4 爬虫实践： 获取百度贴吧的内容 https://zhuanlan.zhihu.com/p/26722495\n* bs4 爬虫实践： 获取双色球中奖信息 https://zhuanlan.zhihu.com/p/26747717\n* bs4 爬虫实践： 排行榜小说批量下载 https://zhuanlan.zhihu.com/p/26756909\n* bs4 爬虫实践： 获取电影信息 https://zhuanlan.zhihu.com/p/26786056\n* bs4 爬虫实践： 悦音台mv排行榜与反爬虫技术 https://zhuanlan.zhihu.com/p/26809626\n------\n### 二： Scrapy 爬虫框架\n\n* Scrapy 爬虫框架的安装与基本介绍 https://zhuanlan.zhihu.com/p/26832971\n* Scrapy 选择器和基本使用 https://zhuanlan.zhihu.com/p/26854842\n* Scrapy 爬虫实践：天气预报\u0026数据存储 https://zhuanlan.zhihu.com/p/26885412\n* Scrapy 爬虫实践：代理的爬取和验证 https://zhuanlan.zhihu.com/p/26939527\n* Scrapy 爬虫实践：糗事百科\u0026爬虫攻防 https://zhuanlan.zhihu.com/p/26980300\n* Scrapy 爬虫实践：重构排行榜小说爬虫\u0026Mysql数据库 https://zhuanlan.zhihu.com/p/27027200\n------\n### 三： 浏览器模拟爬虫\n\n* Selenium模拟浏览器 https://zhuanlan.zhihu.com/p/27115580\n* 爬虫实践：获取快代理 https://zhuanlan.zhihu.com/p/27150025\n* 爬虫实践：漫画批量下载 https://zhuanlan.zhihu.com/p/27155429\n\n### 四： 练手项目\n\n* 爬虫实践：螺纹钢数据\u0026Cookies https://zhuanlan.zhihu.com/p/27232687\n* 爬虫实践：登录正方教务系统 https://zhuanlan.zhihu.com/p/27256315\n* 爬虫应用： requests+django实现微信公众号后台 https://zhuanlan.zhihu.com/p/27625233\n* 爬虫应用： 12306火车票信息查询 https://zhuanlan.zhihu.com/p/27969976\n* 爬虫应用： 利用斗鱼Api抓取弹幕 https://zhuanlan.zhihu.com/p/28164017\n* 爬虫应用： 获取支付宝账单信息 https://zhuanlan.zhihu.com/p/28537306\n* 爬虫应用：IT之家热门段子（评论）爬取 https://zhuanlan.zhihu.com/p/28806210\n* 爬虫应用：一号店 商品信息查询程序 https://zhuanlan.zhihu.com/p/28982497","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FPythonLoversZone%2FLearn-Python-Crawler","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FPythonLoversZone%2FLearn-Python-Crawler","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FPythonLoversZone%2FLearn-Python-Crawler/lists"}