https://github.com/xetys/open-psql-backup

A small bash based tool to manage PostgeSQL backups in a clever way in k8s
https://github.com/xetys/open-psql-backup

backup k8s postgresql

Last synced: 11 months ago
JSON representation

A small bash based tool to manage PostgeSQL backups in a clever way in k8s

Host: GitHub
URL: https://github.com/xetys/open-psql-backup
Owner: xetys
License: apache-2.0
Created: 2018-01-30T17:00:04.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2021-11-23T17:19:18.000Z (over 4 years ago)
Last Synced: 2025-04-29T11:42:23.915Z (about 1 year ago)
Topics: backup, k8s, postgresql
Language: Shell
Homepage:
Size: 12.7 KB
Stars: 7
Watchers: 1
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Open PSQL Backup for Kubernetes

This project contains a small tool for automatic backup management of PostgreSQL databases.

## Motivation

One of the core features is the **regression based deleting**. This means that you will find maximum three hours old backups from today
while just one backup per week from three months ago. That principle is based on the assumption, that normally we need high frequent
updates only to restore from crash, which possibly happened today or yesterday. If we are interested in data, from past year,
it's ok to have just one snapshot for the week. If a database doesn't grow over time, the overall storage needed for the backup
converges to a certain size, instead of growing linear. In a real world, you possibly have a linear-like growth of your database size.
A linear grow of backup storage on top, means you might have a growth of n*2 if you backup all three hours. With regression based deleting
and a constant growth of database size, the backup storage growth is still linear.

## How to use

There are three types of get open-psql-working in your Kubernetes cluster:

* As a Deployment with daemon mode enabled
* One-Time job
* Cronjob

### Deployment

Just run

```
$ kubectl run open-psql-backup --image=xetys/open-psql-backup --env="DAEMON_MODE=1"
```

To run open-psql-backup in the current namespace. Deploy it in a different namespace if you wan't to watch postgres containers in a other namespace than default.
This deployment method is highly compatible with kuberntes 1.3+, as it only uses deployments.

### Onetime job

Run:

```
$ kubectl apply -f k8s/job.yaml
```

to run the tool one time or

## Cronjob

```
$ kubectl apply -f k8s/cronjob.yaml
```

## How this tool finds postgres containers

Open-psql-backup looks for any deployments with the image `postgresql`.

The recent version of this tool used labels to include marked instances. The current version automatically
finds all PostgresSQL instances. Maybe in future, it would be better to allow in- and exclusion of containers.

Note: Currently this tool expects Postgres instances with no password set!

## ./k8s-psql-tool.sh

The core bash script of this project is the k8s-psql-tool. It allows several operations on PSQL databases, such
as backup (either with the old timestamp, or with a given name), restore by name, and executing queries on all instances

usage: ./k8s-psql-tool.sh [OPTIONS...]

Actions:

backup: creates a backup of all PostgreSQL databases

restore: restores PostgreSQL databases

exec: executes a query against all PostgreSQL databases

Options:

--name,-n: do not use a timestamp as backup name, but specify it

--filter,-f: perform action on a filtered set of databases

Examples:
```bash
# just create a backup
./k8s-psql-tool.sh backup
# create a backup in the directory 'my-backup'
./k8s-psql-tool.sh backup -n my-backup
# restore backup 'my-backup'
./k8s-psql-tool.sh restore -n my-backup
# restore backup 'my-backup', but only uaa databases
./k8s-psql-tool.sh restore -n my-backup -f uaa
# run some sql on all dbs
./k8s-psql-tool.sh exec 'select * from table'
# run some sql on uaa database
./k8s-psql-tool.sh exec -f uaa 'select * from table'
```

## Where the backups are stored

The default configuration stores backups in the path `/backups` of the volume named 'backup-dir'. You can change that by using a `PersistentVolumeClaim`
or any other storage strategy you like.

### example with CephFS

The current example shows a binding to a static CephFS using a local secret. With this, you can open the backups
on all nodes by doing a manual ceph mount on that FS to get the backups. This approach works the same in [rook.io](https://rook.io) and its
shared filesystem.

### example with CIFS (old)

My initial case was using the [Storage Boxes](https://www.hetzner.de/storage-box) from Hetzner, which allow CIFS connections. On every node, I've `cifs-utils` installed and mounted
a CIFS mount into `/var/backups/database`.

This could be achieved by adding

```
//uXXX.your-storagebox.de/backup/db-backup /var/backups/databases cifs user,uid=500,rw,suid,username=uXXX,password=YYY 0 0
```

to `/etc/fstab` and mounting it using

```
$ mount -t cifs -o username=uXXX,password=YYY //uXXX.your-storagebox.de/backup/db-backup /var/backups/databases
```

and then installing open-psql-backup

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/xetys/open-psql-backup

Awesome Lists containing this project

README