{"id":19081662,"url":"https://github.com/facsimiles/beautifulsoup","last_synced_at":"2025-10-10T08:01:38.311Z","repository":{"id":256272038,"uuid":"850094555","full_name":"facsimiles/beautifulsoup","owner":"facsimiles","description":"🌐 BeautifulSoup: Effortlessly scrape and parse web data with this powerful Python library! Perfect for developers needing quick and reliable HTML/XML data extraction. Start saving time on your projects today! [MIRROR][UNOFFICIAL]","archived":false,"fork":false,"pushed_at":"2024-09-03T20:50:07.000Z","size":16312,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-22T06:13:55.626Z","etag":null,"topics":["api-scraping","css-selectors","data-extraction","data-mining","dynamic-web-scraping","html-parsing","mirror","mirrored-repository","python","python3","unofficial","unofficial-mirror","web-automation","web-content-extraction","web-crawler","web-scraping","xpath"],"latest_commit_sha":null,"homepage":"https://www.crummy.com/software/BeautifulSoup/","language":"HTML","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/facsimiles.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":"AUTHORS","dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-30T21:43:09.000Z","updated_at":"2024-09-12T14:20:24.000Z","dependencies_parsed_at":"2024-09-10T00:05:23.290Z","dependency_job_id":"061962c6-e7d3-4c58-acdd-79a6d1a228ce","html_url":"https://github.com/facsimiles/beautifulsoup","commit_stats":null,"previous_names":["facsimiles/beautifulsoup"],"tags_count":7,"template":false,"template_full_name":null,"purl":"pkg:github/facsimiles/beautifulsoup","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/facsimiles%2Fbeautifulsoup","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/facsimiles%2Fbeautifulsoup/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/facsimiles%2Fbeautifulsoup/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/facsimiles%2Fbeautifulsoup/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/facsimiles","download_url":"https://codeload.github.com/facsimiles/beautifulsoup/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/facsimiles%2Fbeautifulsoup/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":270816222,"owners_count":24650761,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-17T02:00:09.016Z","response_time":129,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api-scraping","css-selectors","data-extraction","data-mining","dynamic-web-scraping","html-parsing","mirror","mirrored-repository","python","python3","unofficial","unofficial-mirror","web-automation","web-content-extraction","web-crawler","web-scraping","xpath"],"created_at":"2024-11-09T02:38:09.920Z","updated_at":"2025-10-10T08:01:38.213Z","avatar_url":"https://github.com/facsimiles.png","language":"HTML","funding_links":["https://tidelift.com/subscription/pkg/pypi-beautifulsoup4?utm_source=pypi-beautifulsoup4\u0026utm_medium=referral\u0026utm_campaign=readme"],"categories":[],"sub_categories":[],"readme":"Beautiful Soup is a library that makes it easy to scrape information\nfrom web pages. It sits atop an HTML or XML parser, providing Pythonic\nidioms for iterating, searching, and modifying the parse tree.\n\n# Quick start\n\n```\n\u003e\u003e\u003e from bs4 import BeautifulSoup\n\u003e\u003e\u003e soup = BeautifulSoup(\"\u003cp\u003eSome\u003cb\u003ebad\u003ci\u003eHTML\")\n\u003e\u003e\u003e print(soup.prettify())\n\u003chtml\u003e\n \u003cbody\u003e\n  \u003cp\u003e\n   Some\n   \u003cb\u003e\n    bad\n    \u003ci\u003e\n     HTML\n    \u003c/i\u003e\n   \u003c/b\u003e\n  \u003c/p\u003e\n \u003c/body\u003e\n\u003c/html\u003e\n\u003e\u003e\u003e soup.find(text=\"bad\")\n'bad'\n\u003e\u003e\u003e soup.i\n\u003ci\u003eHTML\u003c/i\u003e\n#\n\u003e\u003e\u003e soup = BeautifulSoup(\"\u003ctag1\u003eSome\u003ctag2/\u003ebad\u003ctag3\u003eXML\", \"xml\")\n#\n\u003e\u003e\u003e print(soup.prettify())\n\u003c?xml version=\"1.0\" encoding=\"utf-8\"?\u003e\n\u003ctag1\u003e\n Some\n \u003ctag2/\u003e\n bad\n \u003ctag3\u003e\n  XML\n \u003c/tag3\u003e\n\u003c/tag1\u003e\n```\n\nTo go beyond the basics, [comprehensive documentation is available](https://www.crummy.com/software/BeautifulSoup/bs4/doc/).\n\n# Links\n\n* [Homepage](https://www.crummy.com/software/BeautifulSoup/bs4/)\n* [Documentation](https://www.crummy.com/software/BeautifulSoup/bs4/doc/)\n* [Discussion group](https://groups.google.com/group/beautifulsoup/)\n* [Development](https://code.launchpad.net/beautifulsoup/)\n* [Bug tracker](https://bugs.launchpad.net/beautifulsoup/)\n* [Complete changelog](https://bazaar.launchpad.net/~leonardr/beautifulsoup/bs4/view/head:/CHANGELOG)\n\n# Note on Python 2 sunsetting\n\nBeautiful Soup's support for Python 2 was discontinued on December 31,\n2020: one year after the sunset date for Python 2 itself. From this\npoint onward, new Beautiful Soup development will exclusively target\nPython 3. The final release of Beautiful Soup 4 to support Python 2\nwas 4.9.3.\n\n# Supporting the project\n\nIf you use Beautiful Soup as part of your professional work, please consider a\n[Tidelift subscription](https://tidelift.com/subscription/pkg/pypi-beautifulsoup4?utm_source=pypi-beautifulsoup4\u0026utm_medium=referral\u0026utm_campaign=readme).\nThis will support many of the free software projects your organization\ndepends on, not just Beautiful Soup.\n\nIf you use Beautiful Soup for personal projects, the best way to say\nthank you is to read\n[Tool Safety](https://www.crummy.com/software/BeautifulSoup/zine/), a zine I\nwrote about what Beautiful Soup has taught me about software\ndevelopment.\n\n# Building the documentation\n\nThe bs4/doc/ directory contains full documentation in Sphinx\nformat. Run `make html` in that directory to create HTML\ndocumentation.\n\n# Running the unit tests\n\nBeautiful Soup supports unit test discovery using Pytest:\n\n```\n$ pytest\n```\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffacsimiles%2Fbeautifulsoup","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ffacsimiles%2Fbeautifulsoup","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffacsimiles%2Fbeautifulsoup/lists"}