Skip to main content

Overview

Administrators monitor the health and status of all Kubernetes clusters across all projects from a single view. This includes tracking cluster lifecycle states, node health, control plane availability, and identifying clusters that require attention — stuck in a non-terminal state, unhealthy, or consuming unexpected resources.

Admin Cluster Overview

Navigate to Admin → Containers → Clusters to view all clusters across all projects.
ColumnDescription
NameCluster identifier
StatusLifecycle state: CREATE_COMPLETE, UPDATE_IN_PROGRESS, CREATE_FAILED, etc.
Health StatusKubernetes-level health: HEALTHY, UNHEALTHY, UNKNOWN
Master CountNumber of control plane nodes
Node CountNumber of worker nodes
ProjectOwning project
CreatedProvisioning timestamp
Filter by Status to quickly identify clusters in non-terminal states that require operator attention (e.g., CREATE_IN_PROGRESS for more than 30 minutes).

Cluster Health States

StatusMeaningOperator Action
CREATE_COMPLETECluster deployed and healthyNone required
UPDATE_COMPLETELast update succeededNone required
CREATE_IN_PROGRESSProvisioning in progressMonitor; investigate if >30 min
UPDATE_IN_PROGRESSUpdate (scale/upgrade) in progressMonitor
CREATE_FAILEDProvisioning failedInvestigate status_reason, assist user
UPDATE_FAILEDScale or upgrade failedInvestigate and assist user
DELETE_IN_PROGRESSCluster being deletedMonitor
DELETE_FAILEDDeletion failedManual stack cleanup required

Check Control Plane Availability

For high-availability clusters (3 master nodes), verify the control plane load balancer and all master nodes are healthy:
Show cluster API address
openstack coe cluster show <cluster-name> \
  -f value -c api_address
Test API server availability
curl -sk https://<api-address>:6443/healthz
Expected: ok

Identify Unhealthy Clusters

Navigate to Admin → Containers → Clusters and sort by Health Status. Clusters with UNHEALTHY or UNKNOWN health status should be investigated and the project owner notified.

Audit Inactive Clusters

Identify clusters that may have been abandoned by project teams to reclaim compute resources:
List all clusters with creation date
openstack coe cluster list --all \
  -f table -c name -c project_id -c created_at -c status
Contact the project owner for clusters that have been in CREATE_COMPLETE status for an extended period without recent activity, and confirm whether they are still needed.

Next Steps

Quotas

Manage per-project cluster limits to prevent resource exhaustion.

Troubleshooting

Diagnose failed clusters and stuck lifecycle states.

Security

Audit cluster security groups and RBAC configuration.

Certificates

Monitor and rotate cluster certificate authorities.