Skip to main content

Overview

Xloud Instance HA logs every fault detection and recovery event as a notification. Notifications capture the affected host, the type of failure, the current recovery phase, and the final outcome. This page explains how to monitor live recovery workflows and review historical events to validate your Instance HA configuration.
Prerequisites
  • An active Xloud account with project access
  • Instance HA protection enabled and at least one segment configured

Notification Lifecycle


View Recovery Notifications

Navigate to notifications

Log in to the Xloud Dashboard (https://connect.<your-domain>) and navigate to Project → Compute → Instance HA → Notifications.

Review notification list

Each row represents one fault event. Key columns:
ColumnDescription
Source Host UUIDThe compute host that triggered the notification
TypeCOMPUTE_HOST for host faults or COMPUTE_INSTANCE for guest faults
Generated TimeTimestamp when the fault was first detected
StatusCurrent recovery status
PayloadAdditional fault context (expandable)

Open a notification

Click a notification row to view the full recovery detail — including the list of affected instances and individual action statuses.
A finished status confirms all protected instances have been recovered. A running status means recovery is actively in progress.

Notification Status Reference

StatusMeaningAction Required
newNotification received; recovery queuedNone — wait for recovery to start
runningRecovery workflow in progressNone — monitor for completion
finishedAll instances recovered successfullyVerify instances are ACTIVE
errorRecovery failed for one or more instancesInvestigate and manually recover
ignoredSegment disabled or duplicate notification suppressedCheck segment status

Monitor Individual Instance Recovery

After a recovery event, verify that each affected instance has returned to ACTIVE status.
Navigate to Project → Compute → Instances. Filter instances by the affected compute host (if you know it) or check the instance status column.Instances in a completed recovery show:
  • Status: ACTIVE
  • Host: Updated to the new compute host
  • Power State: Running
All protected instances are ACTIVE on their new hosts after recovery completes.

Recovery History

Review past events to identify patterns and validate ongoing Instance HA effectiveness.
The Notifications list persists all historical events. Sort by Generated Time (descending) to review the most recent recoveries. Click any event to view the complete recovery timeline.

Next Steps

Troubleshooting

Resolve failed notifications and manually recover instances when automatic recovery fails.

Recovery Workflows

Understand the recovery stages, methods, and expected timelines.

Protection Segments

Verify your instances are enrolled in an active protection segment.

Instance HA Admin Guide

Administrator reference for managing segments, monitors, and recovery policies.