https://github.com/grafana/clusterurl
A simple Go package for processing and clustering a URL to extract a generic page route from it
https://github.com/grafana/clusterurl
keep
Last synced: 6 months ago
JSON representation
A simple Go package for processing and clustering a URL to extract a generic page route from it
- Host: GitHub
- URL: https://github.com/grafana/clusterurl
- Owner: grafana
- License: apache-2.0
- Created: 2023-12-07T00:47:27.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-12-07T02:06:42.000Z (over 2 years ago)
- Last Synced: 2025-01-29T11:52:01.471Z (about 1 year ago)
- Topics: keep
- Language: Go
- Homepage:
- Size: 16.6 KB
- Stars: 6
- Watchers: 7
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ClusterURL
The goal of this package is to provide a simple way to process and cluster a URL to extract a generic page route from it. These generic routes can then be used to aggregate data from multiple URLs under a single route.
This is done by a combination of a rules-based parser combined with a ML model trained to determine if a string is [gibberish](https://www.merriam-webster.com/dictionary/gibberish) or not.