https://github.com/th3-c0der/web-crawler
A simple WebCrawler for exploring and downloading content from web pages within a given domain/url.
https://github.com/th3-c0der/web-crawler
th3-c0der th3-coder th3c0der th3coder tool tools web-tool webcrawl webcrawler webcrawlers webcrawling
Last synced: 1 day ago
JSON representation
A simple WebCrawler for exploring and downloading content from web pages within a given domain/url.
- Host: GitHub
- URL: https://github.com/th3-c0der/web-crawler
- Owner: Th3-C0der
- License: mit
- Created: 2024-01-26T10:41:47.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-02-17T02:51:05.000Z (about 2 years ago)
- Last Synced: 2025-08-18T05:39:09.544Z (7 months ago)
- Topics: th3-c0der, th3-coder, th3c0der, th3coder, tool, tools, web-tool, webcrawl, webcrawler, webcrawlers, webcrawling
- Language: HTML
- Homepage:
- Size: 47.9 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## A Simple WebCrawler Made In Python for exploring and downloading content from web pages within a given Domain/URL.
[](https://th3-c0der.github.io)
## About Tool:
- This Tool Crawls The Given URL/Domain And Collects The HTML File Of Each WebPage And Compresses Them Into Zip Archive For Downloading.
- My First Original Tool ^_^
## INSTALLATION :
* `apt update -y`
* `apt upgrade -y`
* `pkg install python -y`
* `pkg install git`
* `git clone https://github.com/Th3-C0der/Web-Crawler`
* `ls`
* `cd Web-Crawler`
* `pip install -r requirements.txt`
## RUN!:
* `cd Web-Crawler`
* `python main.py`
* Open This Url On Your Browser`http://127.0.0.1:5000`
## UPDATE :
* To Update Script →`python update.py`