Skip to content

deltachat/pyinfra-borgbackup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pyinfra module to deploy our backup solution

This module deploys borgbackup, sets up a backup.sh script, and a cron job which executes it nightly. Admins need to generate a passphrase and store it in our pass repository.

Usage

To backup a host (called {host} in the rest of this guide) to our backup server hetzner-backup, you first need to create an SSH key, and add the public key to the backup server's /home/tech/.ssh/authorized_keys file. To do this, run the following commands (replace {host} with the name of your host):

export HOST={host}                                      # enter the name of the host you want to backup here
ssh-keygen -q -t ed25519 -f /tmp/$HOST-backup -C $HOST-backup -N ""
scp hetzner-backup:.ssh/authorized_keys /tmp/hetzner-backup_authorized_keys
echo 'command="borg serve --restrict-to-path /home/backups/'$HOST'/",restrict' $(cat /tmp/$HOST-backup.pub) >> /tmp/hetzner-backup_authorized_keys
scp /tmp/hetzner-backup_authorized_keys hetzner-backup:.ssh/authorized_keys

Now you need to generate a passphrase for the borg repository with pass generate -n delta/{host}/borg-passphrase. This creates an alphanumeric passphrase for the repository.

Then you can add this module to your pyinfra deploy.py script like this:

from pyinfra import host
from pyinfra.facts.files import File
from pyinfra_borgbackup import deploy_borgbackup

host_name = "host"
borg_repo = f"hetzner-backup:backups/{host_name}"
borg_passphrase = "s3cr3t"
borg_initialized = host.get_fact(File, "/root/.ssh/backupkey")
deploy_borgbackup(host_name, borg_passphrase, borg_repo, borg_initialized)

After it has been deployed, you should login to your host via SSH and run /root/backup.sh manually at least once, to create an initial backup and directly spot possible mistakes.

Use Your Own Backup Server

If you are not part of the deltachat admin team, you can not use the default backup server of this module. In this case, you need to upload the /root/.ssh/config file separately, e.g. in your deploy.py file.

You can take a look at our /root/.ssh/config file and adjust it to your needs. To upload it during your deploy.py execution, add somewhere above the deploy_borgbackup() function call in your deploy.py file:

files.put(
    name="create SSH config",
    src="path/to/the/local/ssh/config",
    dest="/root/.ssh/config",
    user="root",
    group="root",
    mode="600",
)

Stop Services During the Backup

During backup, it is recommended to halt services which write data to disk, so the backups don't get inconsistent. To stop systemd services or docker containers during the borg create step of the backup.sh script, you can create a custom python script.

The backup.sh script will try to run /root/backup-pre.py, if the file exists; it calls it with the argument stop before borg create and with the argument start in the end (also if the backup fails for some reason).

You can use the backup-pre.py script from this repository as a template to adjust it for the specific server. You need to upload the script to /root/backup-pre.py in your deploy.py script, e.g. directly before the deploy_borgbackup() call:

from pyinfra import host
from pyinfra.facts.files import File
from pyinfra_borgbackup import deploy_borgbackup

[...]
files.rsync(
    name="Upload backup-pre.py",
    src="files/root/backup-pre.py",
    dest="/root/",
)
deploy_borgbackup("bomba", borg_passphrase, borg_repo, borg_initialized)

Enable Prometheus Monitoring For Borgbackup

If you pass a prometheus path to deploy_borgbackup like this:

deploy_borgbackup(
    [...]
    prometheus_file="/var/lib/node_exporter/textfile_collector/borgbackup_finished.prom",
)

then the backup script will track in this file when it finished successfully, in "seconds since Jan 1 1970":

borgbackup_last_completed 1760518220

Configure a Grafana Alert

You can use a tool like Grafana to get alerted when the backup doesn't finish for 2 days. For example with the following Grafana alert rule:

    1. Enter alert rule name: backup failed
    1. Define query and alert condition
    • A: prometheus, Select "code" in the top right, Metrics browser: time() - borgbackup_last_completed{instance="bomba:9100"}
    • B you can leave as it is
    • C (Threshold): Input A Is Below 172800 (2 days in seconds)
    1. Set evaluation behavior
    • Folder: bomba
    • Evaluation group: bomba
    • Pending period: 10m
    1. You can leave as it is
    1. Add annotations
    • Summary: The last completed backup run has been over two days ago.

Click "Save rule and exit" to confirm.

About

pyinfra deploy for borgbackup

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published