Overview
Grafana connects to Prometheus and other Xloud data sources to provide operational dashboards for infrastructure metrics, service health, storage utilization, and network performance. Grafana is included in the XIMP monitoring stack and is pre-configured with a Prometheus data source pointing to the cluster’s Prometheus instance.Prerequisites
- Grafana 9.0 or later (included in XIMP stack)
- Prometheus deployed and scraping targets (Prometheus integration)
- Grafana admin credentials (default: sourced from XDeploy configuration)
- Network access from the Grafana host to Prometheus on port 9090
Data Source Configuration
- Dashboard
- CLI (API)
Open data source settings
Log in to Grafana and navigate to Configuration → Data Sources → Add data source.
Select Prometheus from the time series databases section.
Configure connection
| Field | Value | Description |
|---|---|---|
| Name | Xloud Prometheus | Display name in dashboard queries |
| URL | http://10.0.1.71:9090 | Prometheus server address |
| Scrape interval | 15s | Must match global.scrape_interval in prometheus.yml |
| HTTP Method | POST | Required for long queries |
Pre-Built Dashboard Templates
Node Exporter Full Dashboard
The Node Exporter Full dashboard (Grafana ID1860) provides comprehensive per-host metrics
including CPU, memory, disk I/O, network throughput, and system load.
- Import via ID
- Import via JSON
Open import dialog
Navigate to Dashboards → Import. Enter dashboard ID
1860 in the
Import via grafana.com field and click Load.The Node Exporter Full dashboard appears with live data for all scraped instances.
Recommended Dashboard IDs
| Dashboard | Grafana ID | Description |
|---|---|---|
| Node Exporter Full | 1860 | Complete per-host metrics — CPU, memory, disk, network |
| Ceph Cluster | 2842 | Ceph OSD, pool, and IOPS metrics |
| Prometheus Stats | 2 | Prometheus internal metrics and scrape health |
| Alertmanager | 9578 | Alert routing and notification delivery status |
Custom Dashboard Configuration
Infrastructure Overview Panel
Create a summary row with stat panels for key fleet metrics:CPU Usage Stat Panel (panel JSON fragment)
Auto-Scaling Group Size Panel
Track the current size of Orchestration auto-scaling groups using a time series panel:Auto-scaling group size query
Alerting in Grafana
Grafana can evaluate alert rules against Prometheus queries and route notifications independently of Alertmanager. Use this for dashboard-level alerts that notify specific teams via Slack, email, or PagerDuty.Grafana alert rule example
Add a contact point
Navigate to Alerting → Contact points → Add contact point.
Select the notification type (Email, Slack, PagerDuty, Webhook) and configure
the destination.
Create a notification policy
Navigate to Alerting → Notification policies. Create a policy that matches
your alert labels (e.g.,
severity=critical) and routes to the appropriate
contact point.Dashboard Variables
Use template variables to make dashboards interactive across multiple instances, projects, and availability zones:Instance variable query
Job variable query
Next Steps
Prometheus Integration
Configure Prometheus scrape targets and alert rules that feed Grafana data sources
XIMP Monitoring
Explore the built-in XIMP monitoring stack that includes pre-configured Grafana
Auto-Scaling
Visualize auto-scaling group size changes on Grafana time series dashboards
Wazuh SIEM
Integrate Wazuh security events into Grafana for unified security dashboards