{"id":28762786,"url":"https://github.com/zebbern/website-clone","last_synced_at":"2025-06-17T08:32:38.637Z","repository":{"id":295308386,"uuid":"989764250","full_name":"zebbern/website-clone","owner":"zebbern","description":"Python script that downloads complete webpages with all assets for offline viewing keeping the structure and style of the page!","archived":false,"fork":false,"pushed_at":"2025-05-24T19:28:01.000Z","size":7,"stargazers_count":6,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-06-16T06:55:04.206Z","etag":null,"topics":["clone-website","downloader","phishing-attacks","phishing-page","red-team-tools","web-clone","website-clone","website-clone-html-css","website-cloner","website-copy","website-design","website-development","website-downloader","website-redirect","website-template"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zebbern.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-05-24T19:24:32.000Z","updated_at":"2025-05-25T18:05:25.000Z","dependencies_parsed_at":"2025-05-24T20:30:35.885Z","dependency_job_id":null,"html_url":"https://github.com/zebbern/website-clone","commit_stats":null,"previous_names":["zebbern/website-clone"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/zebbern/website-clone","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zebbern%2Fwebsite-clone","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zebbern%2Fwebsite-clone/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zebbern%2Fwebsite-clone/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zebbern%2Fwebsite-clone/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zebbern","download_url":"https://codeload.github.com/zebbern/website-clone/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zebbern%2Fwebsite-clone/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":260321703,"owners_count":22991682,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["clone-website","downloader","phishing-attacks","phishing-page","red-team-tools","web-clone","website-clone","website-clone-html-css","website-cloner","website-copy","website-design","website-development","website-downloader","website-redirect","website-template"],"created_at":"2025-06-17T08:30:38.961Z","updated_at":"2025-06-17T08:32:38.607Z","avatar_url":"https://github.com/zebbern.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"## Website Cloner\n\n- Complete webpage download including all assets\n- Preserves original HTML structure and formatting\n- Downloads CSS files and processes internal references\n- Handles images, scripts, and other linked resources\n- Organized folder structure: `sites/domain_name/`\n- Clean console interface with minimal output\n- Firefox-compatible headers for accurate downloads\n\n## Installation\n\n1. Clone this repository:\n```bash\ngit clone https://github.com/zebbern/website-clone\ncd website-clone\n```\n\n## Usage\n\nRun the script and enter the URL when prompted:\n\n```bash\npython clone.py\n```\n\n**Example:**\n```\nEnter URL: github.com/login\nDownloading...\n✅ Saved to: sites/github_com\n```\n\n## Output Structure\n\nDownloaded sites are organized as follows:\n```\nsites/\n└── github_com/\n    ├── github_com.htm          # Main HTML file\n    └── github_com_files/        # Assets folder\n        ├── style.css\n        ├── script.js\n        ├── logo.png\n        └── favicon.ico\n```\n\n## How It Works\n\n1. Downloads the main HTML page while preserving exact structure\n2. Parses HTML using regex to find resource links\n3. Downloads all CSS stylesheets and processes internal URLs\n4. Downloads images, JavaScript files, and other assets\n5. Updates all links to point to local files\n6. Saves everything in an organized folder structure\n\n## Use Cases\n\n- Web archival and research\n- Offline browsing and documentation\n- Website backup and preservation\n- Web development reference\n- Educational purposes\n\n## Requirements\n\n- Python 3.6+\n- requests library\n- Internet connection for downloading\n\n## Technical Details\n\n- Uses Firefox-compatible user agent and headers\n- Processes CSS @import statements and url() references\n- Handles relative and absolute URLs correctly\n- Maintains original HTML formatting and structure\n- Silent operation with minimal console output\n\n## Disclaimer\n\nThis tool is intended for legitimate purposes such as web archival, research, and backup. Users are responsible for respecting website terms of service and copyright laws when using this tool.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzebbern%2Fwebsite-clone","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzebbern%2Fwebsite-clone","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzebbern%2Fwebsite-clone/lists"}