Skip to main content

Overview

XDR operates as a control plane layer over the underlying compute and storage infrastructure, orchestrating continuous replication and coordinated recovery across geographically separated sites. Understanding the component architecture helps administrators size deployments, plan network requirements, and diagnose failures effectively.
Prerequisites
  • Administrator credentials on both primary and DR sites
  • Familiarity with RPO/RTO concepts and distributed storage replication terminology

Component Topology


Core Components

ComponentRole
XDR ControllerCentral orchestration service — manages protection plans, tracks recovery state, and triggers failover
XDR Agent (Primary)Runs on primary site nodes; captures change streams from storage and forwards to DR agent
XDR Agent (DR)Receives replicated data, applies it to DR-site replicas, and executes the recovery runbook during failover
Replication NetworkDedicated link carrying encrypted, optionally compressed replication traffic between sites
XIMP IntegrationProvides health visibility for replication lag, RPO adherence, and site availability

Replication Pipeline

Data flows through a multi-stage pipeline from the primary site to the DR site:

Change capture

The XDR agent on the primary site intercepts write operations at the storage layer, capturing changed data blocks as a continuous stream. For application-consistent replication, the agent coordinates with in-guest agents to quiesce writes at consistent intervals.

Compression and encryption

The change stream is optionally compressed (recommended for WAN links) and encrypted using TLS 1.3 before transmission. Compression reduces bandwidth consumption by 30–60% for typical mixed workloads.

Transfer

The compressed, encrypted stream is transmitted over the replication link to the DR-site XDR agent. Bandwidth throttling applies during peak hours if configured.

Apply to replicas

The DR-site agent writes the received changes to the standby storage replicas. Compute replicas remain stopped — the data is current but the instances are not running.

Recovery point snapshot

At configurable intervals, the XDR agent creates a recovery point — a consistent snapshot of the replicated state. Recovery points define the available restore targets during failover.

Deployment Models

The most common deployment model. One site runs production workloads; the other site holds warm replicas that activate only during failover.
CharacteristicValue
Sites2 (primary + DR)
Production trafficPrimary site only
DR site resource consumptionStorage cost + agent overhead (no compute billing for stopped replicas)
Failover timeMinutes (RTO depends on workload complexity)
RPOSeconds to minutes (asynchronous) or zero (synchronous)
The DR site requires approximately the same storage capacity as the primary site. Compute resources are only consumed during an actual failover or DR test.

Control Plane Placement

The XDR controller can be co-located with the primary site or deployed on a separate management network:
PlacementConsideration
Primary siteSimpler deployment; controller unavailable if primary site fails
DR siteController survives primary site failure; manages failover independently
Dedicated management hostHighest availability; additional infrastructure required
XDeploy clusterRecommended — integrated with XDeploy’s high-availability deployment
Deploy the XDR controller in the XDeploy management cluster. XDeploy runs with redundancy across nodes, so the controller remains available even during a primary site failure event.

Network Requirements

Traffic TypeDirectionPortProtocol
Replication dataPrimary → DRTCP 7000TLS
Replication controlBidirectionalTCP 7001TLS
Agent APIController → AgentsTCP 7002HTTPS
Health checksController → PrimaryConfigurableHTTP/TCP
All ports must be open in both directions between primary and DR site networks. Use a dedicated VLAN or MPLS circuit for replication traffic to avoid impacting production workloads during initial sync or peak change rate periods.

Next Steps

Replication Configuration

Register sites and configure the replication link

Recovery Plans

Define resource groups and recovery ordering

Monitoring

Configure XIMP alerts for replication health

DR Automation

Set up automatic failover triggers and runbook scripts