Overview
Administrators monitor the health and status of all Kubernetes clusters across all projects from a single view. This includes tracking cluster lifecycle states, node health, control plane availability, and identifying clusters that require attention — stuck in a non-terminal state, unhealthy, or consuming unexpected resources.Admin Cluster Overview
- Dashboard
- CLI
Navigate to Admin → Containers → Clusters to view all clusters across all projects.
| Column | Description |
|---|---|
| Name | Cluster identifier |
| Status | Lifecycle state: CREATE_COMPLETE, UPDATE_IN_PROGRESS, CREATE_FAILED, etc. |
| Health Status | Kubernetes-level health: HEALTHY, UNHEALTHY, UNKNOWN |
| Master Count | Number of control plane nodes |
| Node Count | Number of worker nodes |
| Project | Owning project |
| Created | Provisioning timestamp |
Cluster Health States
| Status | Meaning | Operator Action |
|---|---|---|
CREATE_COMPLETE | Cluster deployed and healthy | None required |
UPDATE_COMPLETE | Last update succeeded | None required |
CREATE_IN_PROGRESS | Provisioning in progress | Monitor; investigate if >30 min |
UPDATE_IN_PROGRESS | Update (scale/upgrade) in progress | Monitor |
CREATE_FAILED | Provisioning failed | Investigate status_reason, assist user |
UPDATE_FAILED | Scale or upgrade failed | Investigate and assist user |
DELETE_IN_PROGRESS | Cluster being deleted | Monitor |
DELETE_FAILED | Deletion failed | Manual stack cleanup required |
Check Control Plane Availability
For high-availability clusters (3 master nodes), verify the control plane load balancer and all master nodes are healthy:Show cluster API address
Test API server availability
ok
Identify Unhealthy Clusters
- Dashboard
- CLI
Navigate to Admin → Containers → Clusters and sort by Health Status.
Clusters with
UNHEALTHY or UNKNOWN health status should be investigated
and the project owner notified.Audit Inactive Clusters
Identify clusters that may have been abandoned by project teams to reclaim compute resources:List all clusters with creation date
CREATE_COMPLETE status for
an extended period without recent activity, and confirm whether they are still needed.
Next Steps
Quotas
Manage per-project cluster limits to prevent resource exhaustion.
Troubleshooting
Diagnose failed clusters and stuck lifecycle states.
Security
Audit cluster security groups and RBAC configuration.
Certificates
Monitor and rotate cluster certificate authorities.