https://github.com/codedotjs/url-extractor
:snake: grab and save URLs from a website
https://github.com/codedotjs/url-extractor
Last synced: about 2 months ago
JSON representation
:snake: grab and save URLs from a website
- Host: GitHub
- URL: https://github.com/codedotjs/url-extractor
- Owner: CodeDotJS
- Created: 2015-05-07T18:47:03.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2015-05-07T18:50:23.000Z (over 10 years ago)
- Last Synced: 2025-03-20T00:41:19.979Z (9 months ago)
- Language: Python
- Homepage:
- Size: 113 KB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# web_crawler
Simplest web crawler written in python which crawl and the website entered by you and prepares a .txt file of the url
Requirements :
- re module # sudo pip install re
- urllib module # sudo pip install urllib
----------------------------------------------------------------------------------------------------------------
Download the file :
cd path/to/file
python url-fetch.py