Skip to content

Monitoring Task: System Uptime and Availability

Description

Track system uptime and detect downtime or reboot events.

Detection Methods

  • Prometheus node exporter
  • uptime and who -b
  • Log analysis or external ping checks

Triage Steps

  1. Check system uptime:
    uptime
    
  2. Check last boot time:
    who -b
    
  3. Use external uptime monitoring (e.g., Uptime Kuma or StatusCake)

Preventive Actions

  • Monitor for unexpected reboots
  • Set watchdogs or health check alerts
  • Redundant nodes for HA setups

Tools & Commands

  • uptime, who, last, Prometheus, Grafana, uptime monitors