{"id":26160030,"url":"https://github.com/swhl/baiduimagecrawling","last_synced_at":"2026-02-14T02:03:06.237Z","repository":{"id":272606699,"uuid":"915218642","full_name":"SWHL/BaiduImageCrawling","owner":"SWHL","description":"一个超级轻量的百度图片爬虫, modified from https://github.com/kong36088/BaiduImageSpider","archived":false,"fork":false,"pushed_at":"2025-01-16T00:05:44.000Z","size":14,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-09-15T11:53:29.631Z","etag":null,"topics":["baidu","crawling","image","spider"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SWHL.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null},"funding":{"github":null,"patreon":null,"open_collective":null,"ko_fi":null,"tidelift":null,"community_bridge":null,"liberapay":null,"issuehunt":null,"otechie":null,"lfx_crowdfunding":null,"custom":"https://raw.githubusercontent.com/RapidAI/.github/6db6b6b9273f3151094a462a61fbc8e88564562c/assets/Sponsor.png"}},"created_at":"2025-01-11T09:21:06.000Z","updated_at":"2025-01-17T00:10:44.000Z","dependencies_parsed_at":"2025-01-15T16:28:39.142Z","dependency_job_id":"e80df638-296a-426b-9ec9-cc38257c6fb7","html_url":"https://github.com/SWHL/BaiduImageCrawling","commit_stats":null,"previous_names":["swhl/baiduimagecrawling","swhl/baiduimagespider"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/SWHL/BaiduImageCrawling","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SWHL%2FBaiduImageCrawling","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SWHL%2FBaiduImageCrawling/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SWHL%2FBaiduImageCrawling/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SWHL%2FBaiduImageCrawling/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SWHL","download_url":"https://codeload.github.com/SWHL/BaiduImageCrawling/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SWHL%2FBaiduImageCrawling/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29431593,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-13T22:20:51.549Z","status":"online","status_checked_at":"2026-02-14T02:00:07.626Z","response_time":53,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["baidu","crawling","image","spider"],"created_at":"2025-03-11T11:57:56.683Z","updated_at":"2026-02-14T02:03:06.201Z","avatar_url":"https://github.com/SWHL.png","language":"Python","funding_links":["https://raw.githubusercontent.com/RapidAI/.github/6db6b6b9273f3151094a462a61fbc8e88564562c/assets/Sponsor.png"],"categories":[],"sub_categories":[],"readme":"\u003cdiv align=\"center\"\u003e\n  \u003cdiv align=\"center\"\u003e\n    \u003ch1\u003e\u003cb\u003e🕷️ Baidu Image Crawling\u003c/b\u003e\u003c/h1\u003e\n  \u003c/div\u003e\n\n\u003ca href=\"\"\u003e\u003cimg src=\"https://img.shields.io/badge/Python-\u003e=3.6,\u003c3.12-aff.svg\"\u003e\u003c/a\u003e\n\u003ca href=\"\"\u003e\u003cimg src=\"https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg\"\u003e\u003c/a\u003e\n\u003ca href=\"https://pypi.org/project/baidu_image_crawling/\"\u003e\u003cimg alt=\"PyPI\" src=\"https://img.shields.io/pypi/v/baidu_image_crawling\"\u003e\u003c/a\u003e\n\u003ca href=\"https://pepy.tech/project/baidu_image_crawling\"\u003e\u003cimg src=\"https://static.pepy.tech/personalized-badge/baidu_image_crawling?period=total\u0026units=abbreviation\u0026left_color=grey\u0026right_color=blue\u0026left_text=Downloads\"\u003e\u003c/a\u003e\n\u003ca href=\"https://github.com/SWHL/BaiduImageCrawling/stargazers\"\u003e\u003cimg src=\"https://img.shields.io/github/stars/SWHL/BaiduImageCrawling?color=ccf\"\u003e\u003c/a\u003e\n\u003ca href=\"https://semver.org/\"\u003e\u003cimg alt=\"SemVer2.0\" src=\"https://img.shields.io/badge/SemVer-2.0-brightgreen\"\u003e\u003c/a\u003e\n\u003ca href=\"https://github.com/psf/black\"\u003e\u003cimg src=\"https://img.shields.io/badge/code%20style-black-000000.svg\"\u003e\u003c/a\u003e\n\n\u003c/div\u003e\n\n### 简介\n\n一个超级轻量的百度图片爬虫, modified from \u003chttps://github.com/kong36088/BaiduImageCrawling\u003e\n\n### 安装\n\n```bash\npip install baidu_image_crawling\n```\n\n### Python使用\n\n```python\nfrom baidu_image_crawling.main import Crawler\n\ncrawler = Crawler(0.05, save_dir=\"outputs\")  # 抓取延迟为 0.05\n\n# 抓取关键词为 “美女”，总数为2页，开始页码为1，每页 30 张, 即总共2*30=60张\ncrawler(word=\"美女\", total_page=2, start_page=1, per_page=30)\n```\n\n### 终端使用\n\n```bash\nbaidu_image_crawling -w 美女 -tp 1 -sp 1 -pp 2\n```\n\n查看参数文档：\n\n```bash\n$ baidu_image_crawling -h\nusage: baidu_image_crawling [-h] -w WORD -tp TOTAL_PAGE -sp START_PAGE [-pp [PER_PAGE]] [-sd SAVE_DIR] [-d DELAY]\n\noptions:\n  -h, --help            show this help message and exit\n  -w WORD, --word WORD  抓取关键词\n  -tp TOTAL_PAGE, --total_page TOTAL_PAGE\n                        需要抓取的总页数\n  -sp START_PAGE, --start_page START_PAGE\n                        起始页数\n  -pp [PER_PAGE], --per_page [PER_PAGE]\n                        每页大小\n  -sd SAVE_DIR, --save_dir SAVE_DIR\n                        图片保存目录\n  -d DELAY, --delay DELAY\n                        抓取延时（间隔）\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fswhl%2Fbaiduimagecrawling","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fswhl%2Fbaiduimagecrawling","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fswhl%2Fbaiduimagecrawling/lists"}