Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hootsuite/sens8

Kubernetes controller for Sensu checks
https://github.com/hootsuite/sens8

golang k8s kubernetes kubernetes-controller kubernetes-operator sensu

Last synced: 6 days ago
JSON representation

Kubernetes controller for Sensu checks

Awesome Lists containing this project

README

        

sens8

**Sensu + K8s**

[![CircleCI](https://circleci.com/gh/hootsuite/sens8.svg?style=svg)](https://circleci.com/gh/hootsuite/sens8)

A Kubernetes controller that watches cluster resources (`Deployment`, `Pod` etc.) and runs Sensu checks which are declared in the resource's annotations. For each of the check definitions it will run them at the specified interval, updating the checks with the latest resource info provided by Kubernetes. It communicates directly to Sensu's RabbitMQ endpoint, dynamically adding Sensu [proxy clients](https://sensuapp.org/docs/latest/reference/clients.html#proxy-clients) and optionally removing them on resource deletion or annotation update.

Sens8 effectively acts a Sensu super client. It allows for checks to be run on ephemeral resources such as a pod under a replicaset without having to sidecar the heavy Sensu ruby client, or restart it. By having checks defined in the resources themselves it gives teams greater autonomy and lets checks be pushed out via CD. It also prevents having to manage resource-level checks out of band via configuration management.

Unlike a traditional setup where Sensu's client is installed with check scripts, only a predefined number of checks are supported (i.e., only check definitions are allowed in the annotations). Refer to the [check command documentation](check-docs.md) for what is valid for each resource. Most checks are intended to be run over the network - like making an http request to a pod. For resources such as deployments, only checks based on the status provided by Kubernetes are possible.

A keepalive is run for Sens8 itself, and only itself, since all other checks register under proxy clients. Given its super client nature it's advised to set up alerting/paging appropriately.

### Build

```bash
make all
```

### Deployment

Refer to `kubernetes.yml` for an example deployment + config

### Limitations

* Only RabbitMQ transport is supported (no Redis)
* If Sens8 crashes, all checks that have a ttl will alert
* Subscriptions are ignored
* Only checks are run. No metrics
* Certain resources types such `Deployment` have a limited feature set and only alert of the status provided by Kubernetes.

### Roadmap

See [projects](https://github.com/hootsuite/sens8/projects) for status

Check Schema
============

Checks are defined in the annotation `hootsuite.com/sensu-checks` of a given resource as a json array check objects. The check format is same as [Sensu's check configuration](https://sensuapp.org/docs/0.29/reference/checks.html#check-attributes) minus some fields being ignored. Refer to Sensu's [check definition specification](https://sensuapp.org/docs/0.29/reference/checks.html#check-definition-specification) as a guide.

| field | type | required | example |description |
|-------------|---------|----------|-------------------------------------------|------------|
| name | string | * | `my_check` | Name of the check. Must be unique to the resource it's running on. See [Sensu's check naming](https://sensuapp.org/docs/0.29/reference/checks.html#check-names) |
| command | string | * | `deployment_status --warn 0.9 --crit 0.8` | Check to run. The first chunk of the string is the check id. All other parameters get parsed by the check itself. Environment variables and backticks (sub-shells) are interpolated. Refer to the [check command documentation](check-docs.md) for valid checks that can be run on the given resource |
| interval | int | * | `30` | Interval in seconds between checks |
| deregister | bool | | `true` | Deregister the proxy client sensu on resource deletion or annotation removal. Default value is dependant on resource type. Pod: true, Deployment: false |

All other (arbitrary) fields get passed along with the check result in the same way Sensu client proper does.

Check results will be registered/de-registered with the following template:

| Resource | Source (client name) | Deregister default |
|------------|------------------------------------------|---------|
| Deployment | `deployment-name.deployment.namespace` | `false` |
| Pod | `pod-name.pod.namespace` | `true` |
| Service | `sevice-name.service.namespace` | `false` |
| ReplicaSet | `replicatset-name.replicatset.namespace` | `false` |
| Node | `node-name.node.namespace` | `false` |
| DaemonSet | `daemonSet-name.daemonSet.namespace` | `false` |

**Example check annotation**
```json
[
{
"name": "deployment_status",
"command": "deployment_status --warn 0.9 --crit 0.8",
"interval": 60,
"deregister": true,
"foo": "bar"
}
]
```

**Example check result**

For the given check example above, the check result to Sensu might look like:
```json
[
{
"name": "deployment_status",
"command": "deployment_status --warn 0.9 --crit 0.8",
"interval": 60,
"deregister": true,
"foo": "bar",
"source": "test-app.deployment.default",
"status": 0,
"output": "replicas: 5, updatedReplicas: 5, unavailableReplicas: 0"
}
]
```

## Defaults

It is possible to set defaults for all checks in Sens8's `-config-file` config. Arbitrary fields are respected, but any field defined in the check will override it. It is also possible to set a default for the required `interval`. Example:

```json
{
"defaults": {
"interval": 60,
"occurrences": 3,
"refresh": 900,
"handlers": ["default"],
"pager_team": "production_engineering"
}
}
```

## Example service deployment

```yaml
kind: Deployment
apiVersion: extensions/v1beta1
metadata:
labels:
app: test-app
name: test-app
namespace: default
annotations:
hootsuite.com/sensu-checks: |-
[
{
"name": "deployment_status",
"command": "deployment_status --warn 0.9 --crit 0.8",
"interval": 60,
"occurrences": 3,
"refresh": 900,
"handlers": ["default"],
"pager_team": "production_engineering",
"documentation": "http://example.com/runbook.html",
"deregister": true
}
]
spec:
replicas: 1
revisionHistoryLimit: 1
selector:
matchLabels:
app: test-app
template:
metadata:
labels:
app: test-app
annotations:
hootsuite.com/sensu-checks: |-
[
{
"name": "check_pod_dependencies",
"command": "hs_healthcheck --url http://:::POD_IP::::80/status/dependencies",
"interval": 60,
}
]
spec:
containers:
- name: test-app
image: markeijsermans/debug
command: ["python3"]
args: ["-u", "-m", "http.server", "80"]
imagePullPolicy: Always
ports:
- containerPort: 80
protocol: TCP
```

Checks Commands
===============

Each resource is limited to a subset of commands that can be run against it.

[Check Command Documentation](check-docs.md)

_or_

Get latest docs via: `./sens8 -check-docs`

### Release Process

If master branch contains a git tag, circle-ci will tag the docker image as well. Make sure to push the tag, before the commit. This will likely change when circle-ci 2.0 supports git tags.

### Maintainers

* :octocat: [Mark Eijsermans](https://github.com/markeijsermans)