Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/medik8s/self-node-remediation
Automatic repair for unhealthy Kubernetes nodes
https://github.com/medik8s/self-node-remediation
availability k8s kubernetes kubernetes-node kubernetes-operator maintenance operator remediation
Last synced: 4 days ago
JSON representation
Automatic repair for unhealthy Kubernetes nodes
- Host: GitHub
- URL: https://github.com/medik8s/self-node-remediation
- Owner: medik8s
- License: apache-2.0
- Created: 2022-03-29T08:50:52.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-11-13T11:48:46.000Z (about 1 month ago)
- Last Synced: 2024-12-06T03:17:12.249Z (16 days ago)
- Topics: availability, k8s, kubernetes, kubernetes-node, kubernetes-operator, maintenance, operator, remediation
- Language: Go
- Homepage: https://www.medik8s.io/
- Size: 18.5 MB
- Stars: 46
- Watchers: 7
- Forks: 17
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Self Node Remediation - Automatic Kubernetes Node Remediation
Existing baremetal remediation strategies utilize BMC credentials to power-cycle and/or reprovision the host.
However there are also environments that either do not include BMCs, or there are policies
in place that prevent them from being utilized. Such environments would also benefit from
the ability to safely recover affected workloads and restore cluster capacity (where possible).
This self node remediation controller is using an alternate mechanism for a node in a cluster to detect its health
status and take actions to remediate itself in case of a failure. While not all remediation events can
result in the node returning to a healthy state, the proposal does allow surviving parts of the cluster
to assume the node has reached a safe state so that it’s workloads can be automatically recovered.
This work can also be useful for clusters with BMC credentials.## More Info
https://www.medik8s.io/## Project State
The operator is available in [operator hub](https://operatorhub.io/operator/self-node-remediation)Self Node Remediation works best together with [Node Health Check Operator](https://operatorhub.io/operator/node-healthcheck-operator)
## Help
Feel free to join our google group to get more info - https://groups.google.com/g/medik8s