Overview
This guide covers platform-level DNS issues that require administrator access. For user-facing issues such as record set errors or propagation delays, see the DNS Troubleshooting guide.Diagnostic Checklist
Before investigating individual components, run this quick health check:Check DNS service container status
Verify DNS API is responding
Check nameserver is authoritative for a zone
Platform Issues
DNS API not responding
DNS API not responding
Cause: The DNS API service container may be unavailable or crashed.Resolution:If the container exited unexpectedly, check for configuration errors in the startup
log. Restart affected containers:If the container fails to start, verify the database connection and configuration
files deployed by XDeploy.
Check DNS service containers
Review container logs
Restart DNS API container
Zone changes not propagating to nameservers
Zone changes not propagating to nameservers
Cause: The DNS worker may be unable to push updates to backend nameservers.Resolution:
- Verify the worker container is running and connected to the message queue:
Check DNS worker container
- Check for push errors in the worker logs — look for connection refused or timeout errors to pool target addresses
- Verify network connectivity from the worker host to each nameserver’s management port:
Test connectivity to nameserver target
- If a nameserver is unreachable, temporarily remove it from the pool via XDeploy while investigating
Inconsistent answers from different nameservers
Inconsistent answers from different nameservers
Cause: Zone synchronization lag — one nameserver received the update while another
did not.Diagnosis:SOA serial numbers should match. A lagging nameserver indicates a synchronization
failure.Resolution: Check the DNS worker logs for push errors to the specific nameserver’s
target configuration. If the push is failing due to authentication or connectivity,
correct the target configuration in XDeploy and redeploy.
Check zone serial on each nameserver
DNS worker backlog growing
DNS worker backlog growing
Cause: The worker cannot process zone updates as fast as they are being created —
typically due to a slow or overloaded nameserver backend.Resolution:
Check message queue depth
- If a specific nameserver backend is slow, investigate its resource usage
- Consider adding worker replicas via XDeploy to parallelize processing
- If the backlog grew due to a temporary nameserver outage, it should drain automatically once connectivity is restored
Database connection errors on startup
Database connection errors on startup
Cause: The DNS service cannot reach the database, or the database schema
is not initialized.Resolution:Verify the database service is running and that the DNS service configuration
in XDeploy has the correct connection credentials and host address.
Check DNS API logs for DB errors
Log Locations
| Component | Log Command |
|---|---|
| DNS API | docker logs dns-api |
| DNS Central | docker logs dns-central |
| DNS Worker | docker logs dns-worker |
| DNS MDns | docker logs dns-mdns |
Next Steps
DNS Troubleshooting (User)
User-facing DNS issues — zone errors, propagation, CNAME conflicts
Architecture
Understand component roles to narrow down failure scope
Backend Configuration
Verify and update pool target configuration
Pool Management
Remove or replace unhealthy nameservers from pools