Monitoring Task: System Uptime and Availability¶
Description¶
Track system uptime and detect downtime or reboot events.
Detection Methods¶
- Prometheus node exporter
uptime
andwho -b
- Log analysis or external ping checks
Triage Steps¶
- Check system uptime:
uptime
- Check last boot time:
who -b
- Use external uptime monitoring (e.g., Uptime Kuma or StatusCake)
Preventive Actions¶
- Monitor for unexpected reboots
- Set watchdogs or health check alerts
- Redundant nodes for HA setups
Tools & Commands¶
uptime
,who
,last
, Prometheus, Grafana, uptime monitors