Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/techguy-bhushan/web-spider
multi-threaded webs crawler
https://github.com/techguy-bhushan/web-spider
crawler python web-spider
Last synced: 19 days ago
JSON representation
multi-threaded webs crawler
- Host: GitHub
- URL: https://github.com/techguy-bhushan/web-spider
- Owner: techguy-bhushan
- Created: 2017-09-22T04:34:37.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-09-23T09:27:13.000Z (over 7 years ago)
- Last Synced: 2024-11-17T02:42:15.533Z (3 months ago)
- Topics: crawler, python, web-spider
- Language: Python
- Homepage:
- Size: 1.95 KB
- Stars: 2
- Watchers: 0
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web-Spider
multi-threaded webs crawler# Required in System for run project
* PythonRun trigger.py, before run this script please edit this script set HOME_PAGE and PROJECT_NAME parameter value
PROJECT_NAME value is name of directory where all data will store, HOME_PAGE is base url of web site which you want crawler e.x www.w2school.comIt will collect all links from that web site and store in crawled.txt file