Overview
Failover segments group compute hosts into logical fault domains. Each segment has its own recovery method — determining how instances are relocated when a host fails. Creating well-designed segments is the most important administrative task for Instance HA. Incorrect segment design (too few hosts, wrong recovery method) is the primary cause of failed automatic recovery.Prerequisites
- Administrator role in Xloud Identity
- Compute hosts registered and reachable
- Instance HA service deployed via XDeploy
Segment Design Principles
Match Physical Topology
Group hosts that share a failure domain — the same power circuit, network switch,
or rack. Hosts in the same fault domain should not be in the same segment.
Reserve Enough Headroom
For
auto recovery, maintain 20–30% unused capacity across all hosts in the segment.
For reserved_host, the reserved node must absorb all instances from the largest host.Separate Critical Workloads
Place SLA-critical instances in segments with
reserved_host recovery. Use auto
segments for standard workloads where recovery capacity is shared.Single-Segment Host Membership
A compute host can belong to only one segment. Plan segment boundaries before
registering hosts to avoid re-registration overhead.
Create a Failover Segment
- Dashboard
- CLI
Navigate to Instance HA administration
Log in to the Xloud Dashboard (
https://connect.<your-domain>) and navigate to
Admin → Compute → Instance HA → Segments.Create the segment
Click Create Segment and complete the form:
| Field | Description | Example |
|---|---|---|
| Name | Unique identifier | prod-zone-a |
| Recovery Method | Evacuation algorithm | auto |
| Enabled | Activate immediately | Checked |
| Description | Optional documentation note | Production AZ-A hosts |
Register Hosts in a Segment
- Dashboard
- CLI
Add host
Click Add Host and fill in:
| Field | Description |
|---|---|
| Name | Compute hostname — must match the hostname registered in the Compute service |
| Type | COMPUTE for compute nodes |
| Control Attributes | JSON object with IPMI or SSH connection parameters |
| On Maintenance | Temporarily exclude host from recovery targets |
| Reserved | Designate as a standby node for reserved_host / rh_priority methods |
Manage Segment Lifecycle
- Disable a Segment
- Place Host on Maintenance
- Delete a Segment
Temporarily disable a segment to suppress recovery during maintenance windows.Re-enable after maintenance is complete:
Disable segment
Re-enable segment
Validation
- Dashboard
- CLI
Navigate to Admin → Compute → Instance HA → Segments. Verify:
- All production segments have
Status: ENABLED - Each segment lists the expected compute hosts
- Reserved hosts are correctly flagged for
reserved_hostsegments
Segments are enabled and all compute hosts are registered.
Next Steps
Host Monitors
Configure IPMI and SSH monitors for hosts registered in your segments.
Recovery Methods
Deep-dive into recovery method selection and reserved host configuration.
Engine Configuration
Tune detection timeouts, retry intervals, and engine behaviour.
Architecture
Review the full Instance HA component architecture and deployment topology.