> ## Documentation Index > Fetch the complete documentation index at: https://docs.xloud.tech/llms.txt > Use this file to discover all available pages before exploring further. # Notification Drivers > Configure Instance HA notification drivers — Nova driver, TaskFlow driver, and custom webhook integration. ## Overview Notification drivers are the bridge between external monitoring systems and the Instance HA recovery engine. They receive fault signals in various formats, translate them into structured Instance HA notifications, and route them to the Recovery Engine. Xloud Instance HA ships with the `NovaNotificationDriver` as the default. Custom drivers allow integration with existing monitoring infrastructure such as Prometheus Alertmanager or Nagios. *** ## Built-in Drivers | Driver | Source | Protocol | When to Use | | ------------------------ | -------------------------------------- | ------------ | ---------------------------------------------------- | | `NovaNotificationDriver` | Xloud Compute message bus | AMQP | Default — all standard deployments | | `TaskFlowDriver` | TaskFlow workflow engine | Internal RPC | Advanced workflow orchestration | | Custom webhook driver | Third-party tools (Prometheus, Nagios) | HTTP POST | Environments with existing monitoring infrastructure | *** ## NovaNotificationDriver (Default) The `NovaNotificationDriver` is enabled by default in all Xloud Instance HA deployments. It subscribes to the Xloud Compute AMQP message bus and listens for `compute.host.error` and `compute.instance.error` notification events. When a compute host enters a failure state, the Compute service publishes an error notification on the AMQP message bus. The `NovaNotificationDriver` receives this message, extracts the affected host information, and creates an Instance HA notification record to trigger the recovery workflow. This driver requires no additional configuration beyond what is provided by the standard XDeploy deployment. ```bash title="Check driver configuration" theme={null} grep -i notification_driver \ /etc/xavs/instance-ha/instance-ha.conf ``` Expected output: ``` notification_drivers = nova_notification ``` ```bash title="Confirm AMQP connectivity" theme={null} docker logs masakari_engine | grep -i "notification" ``` *** ## Webhook Notification Driver For environments that use Prometheus, Nagios, or other external monitoring tools as the primary fault detection system, Instance HA exposes an HTTP notification endpoint that accepts structured fault payloads. ### Endpoint ``` POST /v1/notifications Authorization: Bearer Content-Type: application/json ``` ### Payload Format ```json title="Host fault notification payload" theme={null} { "hostname": "compute-01.xloud.local", "type": "COMPUTE_HOST", "payload": { "event": "STOPPED", "cluster_status": "OFFLINE", "host_status": "NORMAL" } } ``` | Field | Type | Description | | ------------------------ | ------ | -------------------------------------------------------- | | `hostname` | string | The compute hostname as registered in the segment | | `type` | string | `COMPUTE_HOST`, `COMPUTE_INSTANCE`, or `COMPUTE_PROCESS` | | `payload.event` | string | `STOPPED` or `STARTED` | | `payload.cluster_status` | string | `ONLINE` or `OFFLINE` | | `payload.host_status` | string | `NORMAL` or `UNKNOWN` | ### Example: Prometheus Alertmanager Webhook Configure an Alertmanager receiver that calls the Instance HA notification endpoint: ```yaml title="alertmanager.yml — webhook receiver" theme={null} receivers: - name: "instance-ha-webhook" webhook_configs: - url: "http://:15868/v1/notifications" http_config: bearer_token: "" ``` The Instance HA API uses Xloud Identity token authentication. Generate a service token for the alertmanager integration using a dedicated service account with the `admin` role. Do not use personal user tokens in production. *** ## TaskFlowDriver The TaskFlow driver enables advanced workflow orchestration for recovery actions. It is used internally when the default recovery workflow requires multi-step sequencing with retry and rollback support. This driver operates transparently alongside the `NovaNotificationDriver` and does not require separate configuration in standard deployments. To customize the TaskFlow task pipeline, implement the `BaseTask` interface and register the plugin in the configuration. In XDeploy, navigate to **Advanced Configuration**. In the **Service Tree**, select **masakari**. Select or create `instance-ha.conf` in the Code Editor. Add the custom workflow targets: ```ini title="TaskFlow workflow in XDeploy Advanced Configuration" theme={null} [recovery_workflow_on_stop] targets = disableComputeNodeTask, PrepareHAEnabled, EvacuateHost ``` Click **Save Current File**. Navigate to **Operations** and run a **reconfigure** action. The recovery engine restarts with the updated workflow pipeline. Engine logs confirm the custom TaskFlow targets are loaded. Edit the configuration file directly and restart the engine container: ```ini title="/etc/xavs/instance-ha/instance-ha.conf" theme={null} [recovery_workflow_on_stop] targets = disableComputeNodeTask, PrepareHAEnabled, EvacuateHost ``` ```bash title="Restart recovery engine" theme={null} docker restart masakari_engine ``` *** ## Validation Navigate to **Instance-HA > Notifications (admin view)**. Simulate a notification by creating one manually (test environments only): * Click **Create Notification** (admin view) * Set type to `COMPUTE_HOST`, hostname to a registered host, event to `STOPPED` * Confirm the notification appears and transitions to `running` Notification is received, logged, and triggers the recovery workflow. ```bash title="Create a test notification (test environments only)" theme={null} openstack notification create \ --hostname compute-01.xloud.local \ --type COMPUTE_HOST \ --payload '{"event": "STOPPED", "cluster_status": "OFFLINE", "host_status": "NORMAL"}' ``` ```bash title="Monitor notification status" theme={null} openstack notification list --status new ``` Creating test notifications in production triggers real recovery workflows. Only use this in isolated test environments. *** ## Next Steps Configure how instances are evacuated after a notification triggers recovery. Configure guest-level monitoring independent of the notification driver. Tune recovery engine timing, retries, and workflow task ordering. Secure the notification API endpoint and service account credentials.