Overview
Xloud Networking distributes work across multiple agents running on compute and network nodes. Monitoring agent health is a core administrative responsibility — a downed agent can silently prevent new port bindings, DHCP assignments, or routing updates. This guide covers how to inspect, manage, and recover agents across your cluster.Prerequisites
- Admin credentials sourced from
admin-openrc.sh openstackCLI installed and configured
Inspect Agent Health
- Dashboard
- CLI
Navigate to Network Agents
Log in to the Xloud Dashboard (
https://connect.<your-domain>) as an administrator
and navigate to Admin → Network → Agents.Review agent status
Each row represents one agent instance. Review the following columns:
| Column | Healthy Value | Action If Unhealthy |
|---|---|---|
| Alive | Yes (green) | Restart the service on the affected host |
| Admin State | Up | Enable via CLI: openstack network agent set --enable |
| Binary | Agent process name | Check system service logs on the host |
| Host | Fully-qualified hostname | Confirm the host is reachable on the network |
| Last Heartbeat | Recent timestamp | Investigate if stale by more than 30 seconds |
Enable and Disable Agents
Disable an agent before performing maintenance on its host to prevent the scheduler from assigning new work to it. Re-enable after maintenance completes.- CLI
Disable agent for maintenance
Re-enable agent after maintenance
Agent Type Reference
| Agent Binary | Type | Runs On | Responsibilities |
|---|---|---|---|
neutron-dhcp-agent | DHCP | Network nodes | IP assignment, DNS, host routes via DHCP |
neutron-l3-agent | L3 | Network nodes | Router NAT, floating IPs, VRRP |
neutron-openvswitch-agent | L2 (SDN) | Compute nodes | SDN-based L2 switching and port bindings |
neutron-linuxbridge-agent | Linux bridge | Compute nodes | Linux bridge-based L2 switching |
neutron-metadata-agent | Metadata | Network nodes | Instance metadata proxy |
neutron-metering-agent | Metering | Network nodes | Traffic metering for billing |
Restart an Agent
When an agent showsAlive: False, restart the service on the affected host. Agents
running inside Docker containers are managed by XDeploy:
Restart networking agents via XDeploy
Confirm agent is alive
Agent shows
Alive: True and a recent heartbeat timestamp.Next Steps
DHCP Configuration
Schedule networks to DHCP agents and configure HA for high availability
L3 Router Configuration
Configure HA and distributed routing across L3 agents
Service Architecture
Understand the distributed agent model and message bus communication
Admin Troubleshooting
Diagnose and resolve agent failures and VXLAN connectivity issues