https://github.com/anuragsinghbhandari/static_scrape
A command line tool clone frontend using url of a website
https://github.com/anuragsinghbhandari/static_scrape
beautifulsoup4 clone-website playwright-python pypi-package python webscraping
Last synced: 5 months ago
JSON representation
A command line tool clone frontend using url of a website
- Host: GitHub
- URL: https://github.com/anuragsinghbhandari/static_scrape
- Owner: anuragsinghbhandari
- License: mit
- Created: 2025-03-05T09:06:17.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2025-03-06T09:27:39.000Z (10 months ago)
- Last Synced: 2025-03-20T05:41:54.608Z (9 months ago)
- Topics: beautifulsoup4, clone-website, playwright-python, pypi-package, python, webscraping
- Language: Python
- Homepage: https://pypi.org/project/StaticScrape/
- Size: 34.2 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# StaticScrape


[](https://pepy.tech/projects/staticscrape)
## 📌 Description
StaticScrape is a command-line tool designed to **clone the front end of any website** effortlessly. It retrieves the HTML structure while removing unnecessary scripts to create a clean static version of the webpage.
## 🚀 Features
- Extracts full HTML structure of a given website.
- Automatically removes unnecessary `` tags.
- Uses **Playwright** to handle JavaScript rendering.
- Saves the cloned page as a static `.html` file.
- Simple CLI interface for easy usage.
## 🛠 Installation
You can install **StaticScrape** using pip:
```sh
pip install StaticScrape
```
After installation, ensure Playwright is set up for browser automation:
```sh
playwright install
```
## 📝 Usage
Run StaticScrape from the command line:
```sh
staticscrape
```
## 💡 Example
```sh
staticscrape
Enter URL: https://example.com
Enter output file name: my_clone
✅ Page saved as my_clone.html
```
## 🤝 Contributing
Feel free to contribute! If you find a bug or have a feature request, open an issue or submit a pull request.
## 📜 License
This project is licensed under the **MIT License**.
## 👤 Author
**Anurag Singh Bhandari**
📧 anuuo3ups@gmail.com
---
Happy Scraping! 🚀