Skip to content

Playbook: Weekly System Health Checks

Goal

Proactively identify issues or misconfigurations across key system areas.

Frequency

Every Monday or as part of weekly ops review.

Checklist

  • Uptime and load averages:
    uptime; w
    
  • Disk usage:
    df -hT
    
  • Top CPU/mem processes:
    top -b -n1 | head -20
    
  • Docker/Podman container health:
    docker ps --format '{{.Names}}: {{.Status}}'
    
  • Log anomalies:
    journalctl -p 3 -xb
    

Bonus Checks

  • Backups up-to-date
  • Certs not close to expiry
  • System updates pending

Output

Document in internal log, Ops Notion, or markdown status entry.