{"id":18263229,"url":"https://github.com/dataabc/weibo-follow","last_synced_at":"2025-04-09T20:11:40.835Z","repository":{"id":45434303,"uuid":"196001072","full_name":"dataabc/weibo-follow","owner":"dataabc","description":"爬取关注列表中微博账号的微博","archived":false,"fork":false,"pushed_at":"2024-05-21T06:52:00.000Z","size":31,"stargazers_count":189,"open_issues_count":21,"forks_count":53,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-04-09T20:11:36.624Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dataabc.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-07-09T12:08:16.000Z","updated_at":"2025-03-20T14:19:46.000Z","dependencies_parsed_at":"2024-05-03T20:48:37.907Z","dependency_job_id":"5407c30d-2585-42f1-a3b2-fd13542c1e00","html_url":"https://github.com/dataabc/weibo-follow","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dataabc%2Fweibo-follow","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dataabc%2Fweibo-follow/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dataabc%2Fweibo-follow/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dataabc%2Fweibo-follow/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dataabc","download_url":"https://codeload.github.com/dataabc/weibo-follow/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248103872,"owners_count":21048245,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-05T11:10:27.391Z","updated_at":"2025-04-09T20:11:40.804Z","avatar_url":"https://github.com/dataabc.png","language":"Python","readme":"**爬取一个或多个指定微博用户关注列表中的微博账号，批量获取其user_id，并写入user_id_list.txt文件。**\u003cbr\u003e\n程序支持读文件，一个用户最多可以获取200个user_id，写入user_id_list.txt文件；然后可以用生成的user_id_list.txt作为config.json文件“user_id_list”参数的值，继续获取user_id，生成文件，最多可以获得200X200=40000个use_id;然后又可以利用这40000个user_id获得40000X200=8000000个user_id，以此类推，可以获得大量user_id。\n\n## 使用方法\n### 获取程序\n```\n$ git clone https://github.com/dataabc/weibo-follow.git\n```\n### 安装依赖\n```\n$ pip install -r requirements.txt\n```\n### 配置config.json文件\n```\n{\n    \"user_id_list\": \"user_id_list.txt\",\n    \"cookie\": \"your cookie\"\n}\n```\nuser_id_list可以填目标用户user_id文件路径，上面填的就是路径，user_id_list.txt内容是user_id，可以写多个，每个user_id一行，可以像下面这样填：\n```\n1669879400\n1223178222\n```\n具体如何获取一个微博用户的user_id，见[如何获取user_id](#如何获取user_id )，\n也可以添加注释，这样填：\n```\n1669879400 Dear-迪丽热巴\n1223178222 胡歌\n```\n\n也可以直接填目标用户user_id，如：\n```\n     \"user_id_list\": [\"1669879400\", \"1223178222\"]\n```\n上面的意思是分别获取user_id为“1669879400”、“1223178222”的用户的关注列表，并将被关注者的user_id写入user_id_list.txt;\u003cbr\u003e\ncookie是微博爬虫cookie，具体如何获取cookie见[如何获取cookie](#如何获取cookie)，获取后用真实的cookie替换“your cookie”就可以。\n\n## 运行程序\n```\n$ python weibo_follow\n```\n\n## 如何获取cookie\n1.用Chrome打开\u003chttps://passport.weibo.cn/signin/login\u003e；\u003cbr\u003e\n2.输入微博的用户名、密码，登录，如图所示：\n![](https://picture.cognize.me/cognize/github/weibospider/cookie1.png)\n登录成功后会跳转到\u003chttps://m.weibo.cn\u003e;\u003cbr\u003e\n3.按F12键打开Chrome开发者工具，在地址栏输入并跳转到\u003chttps://weibo.cn\u003e，跳转后会显示如下类似界面:\n![](https://picture.cognize.me/cognize/github/weibospider/cookie2.png)\n4.依此点击Chrome开发者工具中的Network-\u003eName中的weibo.cn-\u003eHeaders-\u003eRequest Headers，\"Cookie:\"后的值即为我们要找的cookie值，复制即可，如图所示：\n![](https://picture.cognize.me/cognize/github/weibospider/cookie3.png)\n\n## 如何获取user_id\n1.打开网址\u003chttps://weibo.cn\u003e，搜索我们要找的人，如\"迪丽热巴\"，进入她的主页；\u003cbr\u003e\n![](https://picture.cognize.me/cognize/github/weibospider/user_home.png)\n2.按照上图箭头所指，点击\"资料\"链接，跳转到用户资料页面；\u003cbr\u003e\n![](https://picture.cognize.me/cognize/github/weibospider/user_info.png)\n如上图所示，迪丽热巴微博资料页的地址为\"\u003chttps://weibo.cn/1669879400/info\u003e\"，其中的\"1669879400\"即为此微博的user_id。\u003cbr\u003e\n事实上，此微博的user_id也包含在用户主页(\u003chttps://weibo.cn/u/1669879400?f=search_0\u003e)中，之所以我们还要点击主页中的\"资料\"来获取user_id，是因为很多用户的主页不是\"\u003chttps://weibo.cn/user_id?f=search_0\u003e\"的形式，而是\"\u003chttps://weibo.cn/个性域名?f=search_0\u003e\"或\"\u003chttps://weibo.cn/微号?f=search_0\u003e\"的形式。其中\"微号\"和user_id都是一串数字，如果仅仅通过主页地址提取user_id，很容易将\"微号\"误认为user_id。\n","funding_links":[],"categories":["Python"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdataabc%2Fweibo-follow","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdataabc%2Fweibo-follow","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdataabc%2Fweibo-follow/lists"}