Overview
Xloud Instance HA logs every fault detection and recovery event as a notification. Notifications capture the affected host, the type of failure, the current recovery phase, and the final outcome. This page explains how to monitor live recovery workflows and review historical events to validate your Instance HA configuration.Prerequisites
- An active Xloud account with project access
- Instance HA protection enabled and at least one segment configured
Notification Lifecycle
View Recovery Notifications
- Dashboard
- CLI
Navigate to notifications
Log in to the Xloud Dashboard (
https://connect.<your-domain>) and navigate to
Project → Compute → Instance HA → Notifications.Review notification list
Each row represents one fault event. Key columns:
| Column | Description |
|---|---|
| Source Host UUID | The compute host that triggered the notification |
| Type | COMPUTE_HOST for host faults or COMPUTE_INSTANCE for guest faults |
| Generated Time | Timestamp when the fault was first detected |
| Status | Current recovery status |
| Payload | Additional fault context (expandable) |
Notification Status Reference
| Status | Meaning | Action Required |
|---|---|---|
new | Notification received; recovery queued | None — wait for recovery to start |
running | Recovery workflow in progress | None — monitor for completion |
finished | All instances recovered successfully | Verify instances are ACTIVE |
error | Recovery failed for one or more instances | Investigate and manually recover |
ignored | Segment disabled or duplicate notification suppressed | Check segment status |
Monitor Individual Instance Recovery
After a recovery event, verify that each affected instance has returned toACTIVE status.
- Dashboard
- CLI
Navigate to Project → Compute → Instances. Filter instances by the affected
compute host (if you know it) or check the instance status column.Instances in a completed recovery show:
- Status:
ACTIVE - Host: Updated to the new compute host
- Power State:
Running
All protected instances are
ACTIVE on their new hosts after recovery completes.Recovery History
Review past events to identify patterns and validate ongoing Instance HA effectiveness.- Dashboard
- CLI
The Notifications list persists all historical events. Sort by Generated Time
(descending) to review the most recent recoveries. Click any event to view
the complete recovery timeline.
Next Steps
Troubleshooting
Resolve failed notifications and manually recover instances when automatic recovery fails.
Recovery Workflows
Understand the recovery stages, methods, and expected timelines.
Protection Segments
Verify your instances are enrolled in an active protection segment.
Instance HA Admin Guide
Administrator reference for managing segments, monitors, and recovery policies.