{"id":16551122,"url":"https://github.com/michaelcurrin/html-screenshot-py","last_synced_at":"2025-03-21T10:32:08.123Z","repository":{"id":93178588,"uuid":"421368242","full_name":"MichaelCurrin/html-screenshot-py","owner":"MichaelCurrin","description":"Take fullpage screenshots for a batch of URLs with this easy CLI tool","archived":false,"fork":false,"pushed_at":"2025-01-06T13:56:28.000Z","size":114,"stargazers_count":4,"open_issues_count":6,"forks_count":2,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-03-01T05:31:54.536Z","etag":null,"topics":["html","image","python","screenshot","selenium","webscraper"],"latest_commit_sha":null,"homepage":"https://michaelcurrin.github.io/html-screenshot-py/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/MichaelCurrin.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-10-26T09:50:19.000Z","updated_at":"2025-01-06T13:56:31.000Z","dependencies_parsed_at":"2024-10-28T10:18:49.103Z","dependency_job_id":"6e64da9d-44f3-46ed-b08b-a1a6e712ffea","html_url":"https://github.com/MichaelCurrin/html-screenshot-py","commit_stats":null,"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MichaelCurrin%2Fhtml-screenshot-py","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MichaelCurrin%2Fhtml-screenshot-py/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MichaelCurrin%2Fhtml-screenshot-py/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MichaelCurrin%2Fhtml-screenshot-py/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/MichaelCurrin","download_url":"https://codeload.github.com/MichaelCurrin/html-screenshot-py/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244135405,"owners_count":20403796,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["html","image","python","screenshot","selenium","webscraper"],"created_at":"2024-10-11T19:36:25.837Z","updated_at":"2025-03-21T10:32:08.108Z","avatar_url":"https://github.com/MichaelCurrin.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# HTML Screenshot PY 🌐 🖼 🐍\n\u003e Take fullpage screenshots for a batch of URLs with this easy CLI tool\n\n[![GitHub tag](https://img.shields.io/github/tag/MichaelCurrin/html-screenshot-py?include_prereleases=\u0026sort=semver\u0026color=blue)](https://github.com/MichaelCurrin/html-screenshot-py/releases/)\n[![License](https://img.shields.io/badge/License-MIT-blue)](#license)\n\n[![Made with Python](https://img.shields.io/badge/Python-\u003e=3.6-blue?logo=python\u0026logoColor=white)](https://python.org)\n[![dependency - selenium](https://img.shields.io/badge/selenium-3-blue)](https://pypi.org/project/selenium)\n[![dependency - requests](https://img.shields.io/badge/requests-2-blue)](https://pypi.org/project/requests)\n\n\n## About\n\nAn easy Python CLI tool. Provide it a batch of one or more URLs as webpages to scrape. Whether for your own sites or by someone else.\n\n### Formats\n\nIt uses two approaches, depending on the format:\n\n- **HTML pages** - The tool will go through each to load the page, take a screenshot of the _entire_ page and save it a PNG file. Using _selenium_.\n- **Binary data** - For files with a PDF or image extension, the file will be downloaded directly (for speed and reliability) instead of trying to take a screenshot (which could be massive for PDFs with many pages). Using _requests_.\n\n### Use cases\n\nWhen you should use this tool:\n\n- **Archive** - Save a once-off copy of an article or a page design that inspires you, before it moves or disappears from the internet. Add as many URLS you like and download all of them.\n- **Software development** - Create visual snapshots of a page on your website to track improvements and fixes over time. Or watch how a competitor's website changes.\n\n\n## Dependencies\n\nSee [Requirements](/docs/installation.md#requirements).\n\n## Sample usage\n\nFor one webpage, here using https://example.com/ which is a real site you can test against:\n\n```sh\n$ python -m htmlscreenshot.scrape 'https://example.com'\n```\n\nFor multiple pages:\n\n```sh\n$ python -m htmlscreenshot ~/path/to/urls.txt\n```\n\nThen find your screenshots as PNGs in the project's output directory.\n\n\n## Documentation\n\n\u003cdiv align=\"center\"\u003e\n\n[![view - Documentation](https://img.shields.io/badge/view-Online_Documentation-blue?style=for-the-badge)](https://michaelcurrin.github.io/html-screenshot-py/ \"Go to docs site\")\n\n\u003c/div\u003e\n\n\n## License\n\nReleased under [MIT](/LICENSE) by [@MichaelCurrin](https://github.com/MichaelCurrin).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmichaelcurrin%2Fhtml-screenshot-py","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmichaelcurrin%2Fhtml-screenshot-py","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmichaelcurrin%2Fhtml-screenshot-py/lists"}