https://github.com/converged-computing/metrics-operator
Testing designs for a benchmarking operator (in experimental mode!)
https://github.com/converged-computing/metrics-operator
converged-computing high-performance-computing hpc kubernetes metrics operator
Last synced: 5 months ago
JSON representation
Testing designs for a benchmarking operator (in experimental mode!)
- Host: GitHub
- URL: https://github.com/converged-computing/metrics-operator
- Owner: converged-computing
- License: mit
- Created: 2023-07-23T21:56:22.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2024-09-24T18:13:07.000Z (over 1 year ago)
- Last Synced: 2025-09-10T14:49:59.273Z (9 months ago)
- Topics: converged-computing, high-performance-computing, hpc, kubernetes, metrics, operator
- Language: Go
- Homepage: https://converged-computing.github.io/metrics-operator/
- Size: 18.2 MB
- Stars: 5
- Watchers: 4
- Forks: 0
- Open Issues: 10
-
Metadata Files:
- Readme: README.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: .github/CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# metrics-operator

Developing metrics and a catalog of applications to assess different kinds of Kubernetes performance.
We likely will choose different metrics that are important for HPC.
Note that I haven't started the operator yet because I'm [testing ideas for the design](hack/test).
To learn more:
- ⭐️ [Documentation](https://converged-computing.github.io/metrics-operator/) ⭐️
- 🐯️ [Python Module](https://pypi.org/project/metricsoperator/) 🐯️
## Dinosaur TODO
- Figure out issue with errors.IsNotFound not working...
- We need a way for the entrypoint command to monitor (based on the container) to differ (potentially)
- For larger metric collections, we should have a log streaming mode (and not wait for Completed/Successful)
- For services we are measuring, we likely need to be able to kill after N seconds (to complete job) or to specify the success policy on the metrics containers instead of the application
- Add assertions checking for python tests
- Plotting examples (python parsers) needed for
- io-sysstat
- app-kripke
- app-quicksilver
- app-pennant
## License
HPCIC DevTools is distributed under the terms of the MIT license.
All new contributions must be made under this license.
See [LICENSE](https://github.com/converged-computing/cloud-select/blob/main/LICENSE),
[COPYRIGHT](https://github.com/converged-computing/cloud-select/blob/main/COPYRIGHT), and
[NOTICE](https://github.com/converged-computing/cloud-select/blob/main/NOTICE) for details.
SPDX-License-Identifier: (MIT)
LLNL-CODE- 842614