Infrastructure monitoring
Setting hardware thresholds
Alert when CPU, memory, disk, or temperature cross a limit.
Hardware monitors fire alerts when the Capture agent reports a metric above its threshold. Thresholds live on the monitor itself and are configured when you create or edit a hardware monitor.
Configure thresholds
- Open Infrastructure in the sidebar.
- Create a new hardware monitor, or open an existing one and click Configure.
- Under Alert thresholds, set values for:
cpuAlertThreshold— CPU usage (percent)memoryAlertThreshold— memory usage (percent)diskAlertThreshold— disk usage (percent)tempAlertThreshold— CPU/disk temperature (degrees Celsius)
- Save.
Leave a field empty to skip alerting on that metric.
How thresholds trigger
Checkmate compares each Capture reading against the saved threshold. When a metric goes over the limit, the monitor's status flips to exceeded and notifications fire through the channels attached to that monitor. When the metric drops back below the threshold, the monitor returns to up.
Suggested starting points
| Resource | Typical alert level |
|---|---|
| CPU | 85% |
| Memory | 90% |
| Disk | 85% |
| Temperature | 75 °C |
Tune from there based on what your workloads actually run at.