https://github.com/cseas/crawler

Recursive web crawler
https://github.com/cseas/crawler

crawler python seed-webpage

Last synced: about 2 months ago
JSON representation

Recursive web crawler

Host: GitHub
URL: https://github.com/cseas/crawler
Owner: cseas
License: mit
Created: 2018-02-01T18:26:54.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2018-06-19T19:19:54.000Z (about 8 years ago)
Last Synced: 2025-05-17T14:41:22.698Z (about 1 year ago)
Topics: crawler, python, seed-webpage
Language: Python
Size: 5.86 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# crawler

Web crawler to recursively extract all the links reachable from a seed webpage.

Instructions:

1. Run the script crawler.py using python3
2. Paste link of seed webpage
3. Hit Enter