> ## Documentation Index > Fetch the complete documentation index at: https://docs.xloud.tech/llms.txt > Use this file to discover all available pages before exploring further. # Object Storage Admin Troubleshooting > Diagnose and resolve platform-level Xloud Object Storage issues — 507 storage errors, proxy latency, ring inconsistencies, and slow replication. ## Overview This guide covers platform-level object storage issues that require administrator access. For user-facing issues such as 403 access errors or upload timeouts, see the [Object Storage Troubleshooting](/services/object-storage/troubleshooting) guide. **Administrator Access Required** — This operation requires the `admin` role. Contact your Xloud administrator if you do not have sufficient permissions. *** ## Diagnostic Checklist ```bash title="Overall cluster health" theme={null} xavs-storage-recon --all ``` ```bash title="Disk usage across all nodes" theme={null} xavs-storage-recon --diskusage ``` ```bash title="Replication status" theme={null} xavs-storage-recon --replication ``` ```bash title="Ring file consistency" theme={null} xavs-storage-recon --md5 ``` *** ## Platform Issues **Cause**: One or more storage nodes targeted by the ring have insufficient free space to accept the write. **Diagnosis**: ```bash title="Check disk usage per node" theme={null} xavs-storage-recon --diskusage ``` **Resolution**: * Identify nodes or drives above 85% utilization * Expand storage capacity by adding new drives (see [Ring Management](/services/object-storage/ring-management)) * Alternatively, rebalance the ring to shift weight toward nodes with available space: ```bash title="Adjust weight on high-capacity node" theme={null} xavs-ring-builder object.builder set_weight

xavs-ring-builder object.builder rebalance xavs-ring-builder object.builder write_ring ```

**Cause**: Replication traffic competing with foreground I/O, or a storage node with degraded drives experiencing high read latency. **Diagnosis**: ```bash title="Check replication load" theme={null} xavs-storage-recon --replication --verbose ``` If a specific node shows high `replication_time`, inspect that node's disk I/O: ```bash title="Check disk I/O on storage node (SSH required)" theme={null} iostat -x 1 5 ``` **Resolution**: * Consider throttling the replicator with `--concurrency 1` during peak hours * If a specific drive is degraded, reduce its ring weight to shift load away * Replace drives showing high latency or recurring I/O errors in `dmesg` **Cause**: The updated ring file was not distributed to all nodes after a rebalance. **Diagnosis**: ```bash title="Check MD5 of ring files on all nodes" theme={null} xavs-storage-recon --md5 ``` **Resolution**: Nodes with mismatched MD5 hashes have stale ring files. Redistribute the ring to affected nodes: ```bash title="Copy ring files to an affected node" theme={null} scp /etc/xavs-object-storage/*.ring.gz :/etc/xavs-object-storage/ ``` Restart the object-server and replicator on the affected node after distribution. **Cause**: Drive failure, bit-rot, or network errors during replication causing data corruption detected by the auditor. **Diagnosis**: ```bash title="Check quarantine counts" theme={null} xavs-storage-recon --quarantined --verbose ``` ```bash title="Check drive health (SSH to node)" theme={null} smartctl -a /dev/ dmesg | grep -i "error\|fail\|ata" ``` **Resolution**: 1. If the drive is failing, set its ring weight to 0 and rebalance to drain data 2. Replace the physical drive 3. Add the replacement drive to the ring and rebalance 4. The replicator will restore the quarantined objects from healthy replicas Do not simply delete quarantined objects — they may be the only remaining copy if other replicas are also corrupted. Always verify healthy replicas exist before any quarantine cleanup. **Cause**: The proxy-server cannot reach a quorum of storage nodes for an operation. **Diagnosis**: ```bash title="Check proxy container status" theme={null} docker ps --filter name=swift-proxy docker logs swift-proxy --tail 50 ``` ```bash title="Verify storage nodes are reachable from proxy" theme={null} xavs-storage-recon --all | grep -i "error\|fail" ``` **Resolution**: * Verify storage node containers are running: `docker ps --filter name=swift` * Check network connectivity from proxy hosts to storage nodes on ports 6200, 6201, 6202 * If nodes are degraded, the proxy will still serve reads from available replicas but writes require the configured replica quorum *** ## Log Locations | Component | Log Command | | ---------------- | ------------------------------------- | | Proxy server | `docker logs swift-proxy` | | Object server | `docker logs swift-object` | | Container server | `docker logs swift-container` | | Account server | `docker logs swift-account` | | Replicator | `docker logs swift-object-replicator` | *** ## Next Steps User-facing issues — 403 errors, upload timeouts, versioning failures Add capacity and redistribute data after failures Monitor and restore data durability Proactively catch issues before they become outages