Overview
This page covers advanced alert rule configuration beyond basic threshold alerts — including compound multi-condition rules, silencing active alerts during maintenance windows, inhibition rules that suppress lower-severity alerts when a critical one is already firing, and escalation policies.Prerequisites
- An active Xloud account with project access
- At least one notification channel configured (see XIMP Admin — Alert Channels)
Compound Alert Rules
Combine multiple conditions in a single rule using AND/OR logic:- Dashboard
- Rule File
Navigate to Monitoring → Alerting → Alert Rules → New Alert Rule and
click Advanced Mode.In Advanced Mode, add multiple conditions:
This rule fires only when both CPU is above 85% AND available memory is below
15% — reducing false positives from transient CPU spikes.
| Field | Value |
|---|---|
| Condition A | xloud_compute_cpu_utilization > 85 |
| Operator | AND |
| Condition B | xloud_compute_memory_free_pct < 15 |
Silencing Alerts
Silences temporarily suppress alert notifications during planned maintenance. The alert rule continues to evaluate — only notifications are suppressed.- Dashboard
- CLI
Create a silence
Navigate to Monitoring → Alerting → Silences → New Silence.
| Field | Description |
|---|---|
| Matchers | Label selectors that match the alerts to silence (e.g., host=compute-node-03) |
| Duration | How long the silence is active (e.g., 2h) |
| Comment | Reason for the silence (required — links to change ticket) |
| Creator | Your username (auto-populated) |
Inhibition Rules
Inhibition rules suppress lower-severity alerts when a higher-severity alert is already active for the same source. This prevents alert storms during major incidents.Example: Suppress warnings when critical is firing
warning alerts for a host when a critical alert is
already firing for that same host — reducing notification noise during a major outage.
Configure inhibition rules via Monitoring → Administration → Inhibition Rules.
Escalation Policies
Configure multi-tier escalation for critical alerts:Define escalation tiers
Navigate to Monitoring → Alerting → Escalation Policies → New Policy.
| Tier | Channel | Delay | Condition |
|---|---|---|---|
| Tier 1 | PagerDuty on-call | Immediate | Alert fires |
| Tier 2 | Slack ops channel | 5 minutes | Not acknowledged |
| Tier 3 | Page on-call manager | 15 minutes | Still unacknowledged |
GitOps-Based Rule Management
Manage alert rules as code for version-controlled, auditable configurations:Export all current alert rules
Import rules from directory
Next Steps
Metrics & Alerts
Basic alert rule creation for metric thresholds
XIMP Admin — Alert Channels
Configure the notification channels referenced by alert rules
Dashboards
Visualize metrics alongside alert thresholds
Troubleshooting
Diagnose alert rules that are not firing or delivering notifications