https://github.com/philipglazman/sprinter
Basic limited web crawler that prints links limited to a single domain.
https://github.com/philipglazman/sprinter
Last synced: 3 months ago
JSON representation
Basic limited web crawler that prints links limited to a single domain.
- Host: GitHub
- URL: https://github.com/philipglazman/sprinter
- Owner: philipglazman
- Created: 2021-09-02T13:57:47.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2021-09-02T14:01:25.000Z (over 3 years ago)
- Last Synced: 2025-01-01T12:15:10.726Z (5 months ago)
- Language: Go
- Size: 73.2 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Sprinter
Sprinter is a web crawler that looks for links on the same domain.## Installation
makeProgram accepts a flag `-root=`.
## TODO
* Handle HTTP 429, 403, and other mechanisms related to rate-limiting.
* If we can find a sitemap, we should use it.
* Would be nice to add itests, and fuzz tests.