> ## Documentation Index > Fetch the complete documentation index at: https://docs.xloud.tech/llms.txt > Use this file to discover all available pages before exploring further. # Optimization Troubleshooting — User Guide > Resolve common Xloud Optimization issues — empty action plans, stuck audits, failed migrations, and cancelled action plans. ## Overview This page covers common Optimization issues encountered by operators — audits that produce empty plans, audits stuck in `ONGOING`, migrations that fail during execution, and plans that revert after completion. For platform-level issues such as Decision Engine failures or data source connectivity, see the [Admin Troubleshooting](/services/optimization/admin-guide/troubleshooting) guide. *** ## Common Issues **Cause**: The cluster is already optimally placed for the selected goal — the strategy found no hosts below the utilization threshold and no migrations are recommended. **Resolution**: Check current host utilization to confirm whether consolidation is genuinely needed: ```bash title="Check per-host utilization" theme={null} openstack hypervisor list --long ``` If all hosts show healthy, even utilization — this is expected behaviour. No action is needed. If hosts appear imbalanced but no plan was generated, the strategy threshold may be too conservative: ```bash title="Create audit with lower threshold" theme={null} watcher audit create \ --goal server_consolidation \ --parameter threshold=0.1 \ --name lower-threshold-audit ``` The default consolidation threshold is 0.2 (20%). Lowering it to 0.1 (10%) means more hosts qualify as underutilized and are included in the migration plan. **Cause**: The Decision Engine is waiting for metric data from a slow or unavailable data source (Prometheus or Telemetry). **Resolution**: ```bash title="Check audit status and duration" theme={null} watcher audit show \ -f value -c state -c created_at ``` If the audit has been `ONGOING` for more than 5 minutes, contact your administrator to check Decision Engine and data source connectivity. Your administrator can configure this through [XDeploy](/deployment). For non-telemetry goals (e.g., `server_consolidation`, `zone_migration`), audits should complete within 30–90 seconds. Longer durations indicate a data collection issue. **Cause**: A live migration failed — commonly due to insufficient memory on the target host, a CPU model incompatibility between source and destination hosts, or a storage connectivity issue. **Resolution**: ```bash title="Show failed action details" theme={null} watcher action show -f json ``` Review the `fault` field for the specific migration error. Common errors: | Error | Cause | Fix | | --------------------- | --------------------------------------------- | ------------------------------------------- | | `No valid host found` | Target host has insufficient capacity | Add compute capacity or adjust plan | | `CPU compatibility` | CPU model mismatch between hosts | Configure `cpu_mode=custom` on all hosts | | `Disk not found` | Instance uses local disk (not shared storage) | Verify instance uses shared storage backend | After resolving the root cause, create a new audit to generate a fresh plan. **Cause**: A previous action in the plan failed, causing the Applier to halt and cancel all remaining actions automatically. **Resolution**: Review the failed action to identify the root cause: ```bash title="List actions and find the failed one" theme={null} watcher action list \ --action-plan \ -f table -c uuid -c action_type -c state ``` Fix the root cause (capacity, CPU compatibility, storage), then run a new audit to generate an updated plan reflecting the current cluster state. **Cause**: Another process — the compute scheduler placing new instances, auto-scaling, or manual migrations — is placing instances back on hosts that were just emptied by the optimization. **Resolution**: Coordinate with team members performing manual migrations during optimization windows. Consider applying compute host aggregates or availability zone constraints to prevent the scheduler from re-populating hosts that were intentionally consolidated. *** ## Diagnostic Commands ```bash title="List all audits with states" theme={null} watcher audit list \ -f table -c uuid -c name -c state -c created_at ``` ```bash title="Show full audit detail" theme={null} watcher audit show -f json ``` ```bash title="List action plans with states" theme={null} watcher actionplan list \ -f table -c uuid -c state -c audit_uuid ``` ```bash title="Show individual action failures" theme={null} watcher action show -f json ``` *** ## Next Steps Create a new audit after resolving the issue. Review past audits to identify recurring patterns. Platform-level diagnostics for Decision Engine and data source failures. Verify shared storage and live migration capability for optimization actions.