https://github.com/datumbrain/webfetcher-tutorial
https://github.com/datumbrain/webfetcher-tutorial
Last synced: 11 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/datumbrain/webfetcher-tutorial
- Owner: datumbrain
- Created: 2020-02-20T10:20:53.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2022-09-01T23:38:55.000Z (almost 4 years ago)
- Last Synced: 2025-06-25T13:07:11.290Z (11 months ago)
- Language: Scala
- Size: 16.6 KB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Datum Brain — Web Fetcher
It fetches web page from the given URL, parses it, count the number of in-bound and out-bound links and then stores its output in a JSON file.
#### Known Bugs:
- Following inbound links are shown as outbound links
- links starting without domain, e.g. `/home`
- `www.google.com` -> `www.google.com.pk`