Monitoring Task: System Uptime and Availability¶
Description¶
Track system uptime and detect downtime or reboot events.
Detection Methods¶
- Prometheus node exporter
uptimeandwho -b- Log analysis or external ping checks
Triage Steps¶
- Check system uptime:
uptime - Check last boot time:
who -b - Use external uptime monitoring (e.g., Uptime Kuma or StatusCake)
Preventive Actions¶
- Monitor for unexpected reboots
- Set watchdogs or health check alerts
- Redundant nodes for HA setups
Tools & Commands¶
uptime,who,last, Prometheus, Grafana, uptime monitors