{"id":48094448,"url":"https://github.com/mashukui/xhs_pic_tool","last_synced_at":"2026-04-04T15:34:37.568Z","repository":{"id":312256111,"uuid":"1046786333","full_name":"mashukui/xhs_pic_tool","owner":"mashukui","description":"用python开发的小红书图片采集软件，支持下载小红书笔记无水印图片、采集笔记数据、评论数据等。小红书爬虫|小红书无水印图片|小红书无水印下载|小红书评论爬虫|小红书采集工具|小红书评论采集|小红书采集软件|小红书爬取数据|xiaohongshu|xhs|XHS","archived":false,"fork":false,"pushed_at":"2026-02-12T00:24:50.000Z","size":25,"stargazers_count":2,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2026-02-12T10:54:45.560Z","etag":null,"topics":["crawler","gui","gui-application","python-spider","spider","xhs","xhs-downloader","xhs-spider","xiaohongshu","xiaohongshu-downloader"],"latest_commit_sha":null,"homepage":"https://github.com/mashukui/xhs_pic_tool","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mashukui.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-08-29T08:14:46.000Z","updated_at":"2026-02-12T00:24:53.000Z","dependencies_parsed_at":"2025-08-29T14:36:52.879Z","dependency_job_id":"12c3e8b2-1610-480c-9e37-166b131ad255","html_url":"https://github.com/mashukui/xhs_pic_tool","commit_stats":null,"previous_names":["mashukui/xhs_pic_tool"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/mashukui/xhs_pic_tool","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mashukui%2Fxhs_pic_tool","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mashukui%2Fxhs_pic_tool/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mashukui%2Fxhs_pic_tool/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mashukui%2Fxhs_pic_tool/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mashukui","download_url":"https://codeload.github.com/mashukui/xhs_pic_tool/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mashukui%2Fxhs_pic_tool/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31403959,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-04T10:20:44.708Z","status":"ssl_error","status_checked_at":"2026-04-04T10:20:06.846Z","response_time":60,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["crawler","gui","gui-application","python-spider","spider","xhs","xhs-downloader","xhs-spider","xiaohongshu","xiaohongshu-downloader"],"created_at":"2026-04-04T15:34:37.484Z","updated_at":"2026-04-04T15:34:37.554Z","avatar_url":"https://github.com/mashukui.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# xhs_pic_tool\n\u003e _马哥原创：用python开发的小红书图片采集软件，支持下载笔记图片、采集笔记数据、评论数据等。_\n\n\u003e _本软件工具仅限于学术交流使用，严格遵循相关法律法规，符合平台内容合法合规性，禁止用于任何商业用途！_\n\n# 一、背景介绍\n## 1.1 爬取目标\n\u003cimg width=\"2046\" height=\"328\" alt=\"xhs_slogon\" src=\"https://github.com/user-attachments/assets/872781ac-ebf7-4301-81e7-2d9ea695d717\" /\u003e\n\n小红书作为国内极具影响力的社区种草平台，汇聚了大量用户且拥有极高的日活跃度，其笔记数据蕴含丰富的信息价值。在合法合规、遵循平台规则以及尊重用户隐私的前提下，对小红书笔记、评论、图片数据进行合理采集分析，能够帮助企业和用户更好的了解热门趋势和对标热门作品，助力从业者高效创作优质笔记！\n\n基于以上背景，我利用python语言开发了一款软件“**爬小红书图片软件**”，但功能不止采集图片，还包括笔记数据、评论数据等。\n\n## 1.2 结果展示\n软件界面如下：(目前已升至v2.0版)\u003cimg width=\"850\" height=\"678\" alt=\"软件界面v2 0_mac_1\" src=\"https://github.com/user-attachments/assets/3ebf710d-8949-4461-8fb8-9a71fcde9c12\" /\u003e\n\n\n保存的csv数据：（截图中展示的就是全部字段了）\n![图2-csv数据](https://files.mdnice.com/user/32110/4df3378a-48d2-4bb3-8b36-fcb2d7b38d0d.png)\n\n\u003e 如果csv截图看不清，可移步这里，更清晰：https://docs.qq.com/sheet/DVEFhZlFKR1NXVEdN?tab=2ophep\n\n自动下载的图片：\n![图3-下载的笔记图片](https://files.mdnice.com/user/32110/2bebc609-2828-4f12-aeb3-73444fef319a.jpg)\n\n附：csv文件中的“序号”列，和图片文件名前缀序号一一对应，2.3章节有详细解释。\n\n## 1.3 演示视频\n软件操作演示视频：[【爬虫软件】小红书图片采集工具，一键下载无水印图片](https://mp.weixin.qq.com/s/VGiqBEg5hCQuEzOGTJqxSQ)\n\n## 1.4 软件说明\n重要说明，请详读：\n```python\n1、Windows系统、Mac系统均可运行！\n2、需要在cookie.txt中填入cookie值，持久存储，方便长期使用\n3、支持筛选笔记搜索关键词、笔记类型、排序方式，选择是否下载图片、是否采集评论等功能\n4、爬取过程中，自动保存结果到csv文件（每爬一条存一次，防止数据丢失）\n5、爬取过程中，有log文件详细记录运行过程，方便回溯\n6、可爬20+关键字段，含：关键词,序号,笔记id,笔记链接,笔记链接_长,笔记标题,笔记内容,笔记类型,发布时间,修改时间,IP属地,点赞数,收藏数,评论数,转发数,用户昵称,用户id,用户主页链接,头图链接,评论内容(若干)\n```\n\n# 二、功能介绍\n## 2.1 软件界面功能\n\n软件界面上的设置项，包含：\n```python\n1. 笔记搜索关键词（支持填多个）\n2. 笔记类型：综合/图文/视频\n3. 排序方式：综合/最新/最热\n4. 下载图片：是/否\n5. 前几条笔记：1~220（即，每个关键词爬前几条笔记）\n6. 采集评论：是/否（不含二级评论）\n7. 评论页数：可填选（即，采集评论的前几页，每页10条评论）\n```\n软件界面，见图1。\n\n## 2.2 数据导出功能\n软件运行过程中，自动导出数据结果到当前目录的csv文件，csv文件以时间戳命名，方便查找。\n\n保存的字段有20+，含：关键词,序号,笔记id,笔记链接,笔记链接_长,笔记标题,笔记内容,笔记类型,发布时间,修改时间,IP属地,点赞数,收藏数,评论数,转发数,用户昵称,用户id,用户主页链接,头图链接,评论内容(若干)。\n\n由于评论数量由用户指定，所以这里写的字段总数量是20+。\n\ncsv数据结果，见图2。\n\n## 2.3 图片保存功能\n图片的保存规则为：\n\n按照爬取的顺序序号保存，比如一个关键词下大约220条笔记，那么csv里存的笔记每条序号为001-220，对应的图片文件名就是001-01.jpg，001-02.jpg，002-01.jpg，以此类推。\n\n一个关键词爬取下载的所有图片都存进《关键词》命名的文件夹里。\n\n简单来说，文件夹以《关键词》命名，文件夹里的每个图片文件以序号命名，图片序号和csv里的序号一一对应。从而实现对应关系，方便查找指定笔记对应的图片。\n图片保存结果，见图3。\n\n# 三、代码讲解\n## 3.1 爬虫采集模块\n\n此软件开发成本较高，代码量大、实现逻辑复杂，为保护个人知识版权，防止恶意盗版软件，不展示爬虫核心代码。\n\n## 3.2 cookie获取\n运行软件之前，需要填写cookie值到txt配置文件中，获取方法如下：\n![cookie获取方法](https://files.mdnice.com/user/32110/4c25d085-0348-407b-bfc3-39376107c7c7.png)\n\n然后把获取到的cookie值存入当前文件夹下的cookie.txt文件，并保存。\n\n## 3.3 软件界面模块\n主窗口部分：\n```python\n# 创建主窗口\nroot = tk.Tk()\nroot.title('爬小红书图片软件v1.0 | 马哥python说')\n# 设置窗口大小\nroot.minsize(width=850, height=650)\n```\n部分界面控件：\n```python\n# 搜索关键词\ntk.Label(root, justify='left', text='搜索关键词:').place(x=30, y=100)\nentry_kw = tk.Text(root, bg='#ffffff', width=78, height=2, )\nentry_kw.place(x=110, y=100, anchor='nw')  # 摆放位置\ntk.Label(root, justify='left', text='多关键词以空格分隔', fg='red').place(x=665, y=100)\n```\n日志输出控件：\n```python\n# 运行日志\ntk.Label(root, justify='left', text='运行日志:').place(x=30, y=250)\nshow_list_Frame = tk.Frame(width=780, height=300)  # 创建\u003c消息列表分区\u003e\nshow_list_Frame.pack_propagate(0)\nshow_list_Frame.place(x=30, y=270, anchor='nw')  # 摆放位置\n```\n## 3.4 日志模块\n好的日志功能，方便软件运行出问题后快速定位原因，修复bug。\n\n核心代码：\n```python\ndef get_logger(self):\n    self.logger = logging.getLogger(__name__)\n    # 日志格式\n    formatter = '[%(asctime)s-%(filename)s][%(funcName)s-%(lineno)d]--%(message)s'\n    # 日志级别\n    self.logger.setLevel(logging.DEBUG)\n    # 控制台日志\n    sh = logging.StreamHandler()\n    log_formatter = logging.Formatter(formatter, datefmt='%Y-%m-%d %H:%M:%S')\n    # info日志文件名\n    info_file_name = time.strftime(\"%Y-%m-%d\") + '.log'\n    # 将其保存到特定目录\n    case_dir = r'./logs/'\n    info_handler = TimedRotatingFileHandler(filename=case_dir + info_file_name,\n                        when='MIDNIGHT',\n                        interval=1,\n                        backupCount=7,\n                        encoding='utf-8')\n```\n软件运行过程中生成的日志文件：\n![log文件](https://files.mdnice.com/user/32110/e8033fb9-101b-4181-85d5-739fe2ac8d1d.jpg)\n\n# 四、付费说明\n## 4.1 卡密说明\n\n费用如下：\n```python\n日卡：使用期限1天，39元。适合试用等临时场景\n月卡：使用期限1个月，149元。适合短期采集需求\n季卡：使用期限3个月，399元。适合中期采集需求\n年卡：使用期限1年，799元。适合长期采集需求\n```\n**[点击这里，自助开通！](https://mgnb.pro/product/xhs)**\n\n## 4.2 一机一码\n软件采用一机一码机制，一个卡密只能在一台电脑运行、不可多电脑运行。\n## 4.3 软件多开\n一台电脑仅允许运行一个软件，不支持软件多开。\n\n## 4.4 软件维护\n软件由本人独立原创开发，长期维护更新，提供稳定运行​。\n\n# 五、软件包获取\n\n**本项目已整合到[爬小红书聚合软件(xhs_one_spider)](https://github.com/mashukui/xhs_one_spider)，建议直接使用聚合版本，功能更全面、维护更及时！**\n\n公众号\"**老男孩的平凡之路**\"，后台回复\"**爬小红书聚合软件**\"获取最新软件安装包。\n\u003cimg width=\"1938\" height=\"364\" alt=\"二维码-公众号放底部v2\" src=\"https://github.com/user-attachments/assets/1d297eb7-f7d8-4113-995d-92c638a02f8d\" /\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmashukui%2Fxhs_pic_tool","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmashukui%2Fxhs_pic_tool","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmashukui%2Fxhs_pic_tool/lists"}