2025: Deployment and Monitoring

Goal: Reduce time required for maintenance.

Targets:

  • Infrastructure changes are rolled out automatically...
    • for VMs
    • for physical hosts
  • Monitoring makes sure OPS gets notified if a
    • host has problems (storage, memory, load) or is not available (SSH, ping)
    • service is misbehaving. (port, metrics)
  • We have remote access to all important machines, also during boot time.
    • Automatic HDD unlock with secure boot
    • Remote management option on important hosts
  • Commits are tested automatically to reduce risk of breaking things.
  • There is a disaster recovery path for our services that are
    • consumer facing (cloud, vault)
    • OPS-critical (Backups, VPN)
    • developer services (git, CI)
6
Backlog
#49 opened 2025-03-24 21:33:28 +01:00 by fabianhauser
5 / 65
#62 opened 2025-04-27 15:14:50 +02:00 by fabianhauser
#8 opened 2024-10-02 19:06:15 +02:00 by fabianhauser
1 / 9
#6 opened 2024-10-02 19:01:53 +02:00 by fabianhauser
0 / 4
#53 opened 2025-04-12 13:45:39 +02:00 by fabianhauser
0 / 4
0
In Progress