Global Cluster Disaster Recovery

This page is the disaster recovery entry point for a global cluster that runs on Immutable Infrastructure. Deploy-time disaster recovery configuration is part of the global cluster installation procedure.

Deployment Procedure

Use Optional Disaster Recovery Deployment in the installation guide when you create the primary and standby global clusters.

That deployment procedure is the authoritative source for the installation-time DR configuration, including:

Primary and standby clusters use the same Kubernetes API server encryption provider configuration.
The etcd server certificate SAN list includes both the primary and standby control plane VIPs and the Platform Access Address.
Huawei DCS deployments reference the shared encryption provider Secret from DCSCluster.spec.encryptionProviderConfigRef.
VMware vSphere deployments write the same /etc/kubernetes/encryption-provider.conf file through KubeadmControlPlane.spec.kubeadmConfigSpec.files.
Huawei Cloud Stack deployments write the same /etc/kubernetes/encryption-provider.conf file through KubeadmControlPlane.spec.kubeadmConfigSpec.files.
Bare Metal deployments reference the shared encryption provider Secret from BaremetalCluster.spec.encryptionProviderConfigRef and use the same Kubernetes ServiceAccount signing key on both sides.
VMware vSphere and Huawei Cloud Stack create the dcs-import-extra-resources ConfigMap before installer import so the installation can preserve provider-specific resources. The name keeps the dcs prefix for historical installer compatibility. Huawei DCS uses the built-in provider resource migration unless extra resources must be imported.
Bare Metal creates the dcs-import-extra-resources ConfigMap before installer import so the installation can preserve bare-metal and elemental resources required by handoff.
The primary cluster installs global-etcd-sync with the standby cluster connection values after both installations succeed.

Operational Scope

After the primary and standby clusters are installed, operate DR as a separate lifecycle process. Keep the installation manifests aligned with the installation guide, then use an approved operations runbook for the following tasks:

Verify etcd-sync health and replication lag.
Verify that the standby cluster can decrypt Kubernetes Secrets created on the primary cluster.
Validate the primary and standby control plane VIPs and the platform access path before a planned failover.
Execute failover and failback with an approved operations procedure.
Reconcile provider-specific resources after a failover.

Verify consistency before uninstalling the etcd synchronization plugin

A disaster recovery switchover and a DR-aware global cluster upgrade both uninstall the etcd synchronization plugin. Before that uninstall, confirm that the standby global cluster data is consistent with the primary. On Immutable Infrastructure, workload-cluster nodes are backed by Cluster API Machine objects, so an incorrect owner-reference resolution after an inconsistent sync can delete those Machine objects and destroy the backing virtual machines. If the consistency check reports missing or surplus keys, do not uninstall the plugin; resolve the inconsistency or contact technical support first. For the detailed switchover and upgrade procedures, see Global Cluster Disaster Recovery and Upgrade the global cluster.

Bare Metal DR Model

Bare Metal DR adds host-level state that does not exist in VM-based providers. The global cluster does not create physical machines; it manages long-lived MachineInventory objects and writes plan Secrets that are consumed by elemental-system-agent on each host. Because of that, a standby global cluster must be able to accept the existing system-agent token and must receive the workload cluster inventory and plan state without overwriting its own global inventory.

Endpoint Rules

Use different system-agent endpoints for global hosts and workload-cluster hosts:

Machine type	Endpoint
Machines that belong to the active `global` cluster	`https://<current-global-control-plane-vip>/kubernetes/global`
Machines that belong to non-`global` workload clusters	`https://<platform-domain>/kubernetes/global`

Both endpoints are platform ingress paths, not direct kube-apiserver :6443 endpoints. For the bootstrap KIND phase, global host registration uses the bootstrap host endpoint, normally https://<bootstrap-host-ip>:12443. Handoff changes only the installed global hosts to the final VIP endpoint.

Do Not Reuse the Bootstrap ISO After Handoff

A SeedImage generated in the bootstrap KIND environment points at the bootstrap registration endpoint. After the global cluster is installed and handed off, create new MachineRegistration and SeedImage resources on the active global cluster for any new global hosts. A pre-failover workload ISO can be reused after failover only when its registration URL uses the platform domain and the matching non-global MachineRegistration was synchronized to standby.

For a post-handoff global host registration, set the system-agent server URL annotation to the current active global control-plane VIP. The elemental operator appends the /kubernetes/global path when it renders the system-agent endpoint. Use the primary VIP before failover and the standby VIP after failover.

apiVersion: elemental.cattle.io/v1beta1
kind: MachineRegistration
metadata:
  name: global-<purpose>
  namespace: cpaas-system
  annotations:
    baremetal.cluster.io/system-agent-server-url: https://<current-global-control-plane-vip>

Shared Token Requirements

Existing hosts keep the baremetal-system-agent token that they received before failover. For that token to work on the standby cluster:

Primary and standby must use the same /etc/kubernetes/pki/sa.key and /etc/kubernetes/pki/sa.pub.
Primary and standby must use the same service-account-issuer and api-audiences values, normally https://kubernetes.default.svc.cluster.local.
The ServiceAccount, token Secret, Role, and RoleBinding for baremetal-system-agent must be synchronized exactly to standby.
Role/baremetal-system-agent.rules[].resourceNames must contain only plan Secret names. Do not grant namespace-wide Secret access and do not add registry, bootstrap, kubeconfig, or platform credential Secrets.

Verify the permission boundary on standby before failover.

TOKEN=$(kubectl -n cpaas-system get secret baremetal-system-agent-token -o jsonpath='{.data.token}' | base64 -d)

kubectl --token="${TOKEN}" -n cpaas-system auth can-i get secret <plan-secret-name>
kubectl --token="${TOKEN}" -n cpaas-system auth can-i patch secret <plan-secret-name>
kubectl --token="${TOKEN}" -n cpaas-system auth can-i get secret global-registry-auth
kubectl --token="${TOKEN}" -n cpaas-system auth can-i list secrets

Expected result: get and patch are allowed only for plan Secrets; global-registry-auth and namespace-wide list secrets are denied.

etcd-sync Scope

Deploy etcd-sync on the standby cluster. The sync direction is one-way before failover:

source: primary global etcd
target: standby global etcd

Synchronize the workload-cluster resources that allow standby to continue reconciling existing Bare Metal clusters:

Non-global CAPI resources: Cluster, Machine, MachineDeployment, KubeadmControlPlane, bootstrap templates, and related owner resources.
Non-global Bare Metal provider resources: BaremetalCluster, BaremetalMachine, BaremetalMachineTemplate, and MachineInventoryPool.
Non-global elemental resources: MachineInventory and MachineRegistration.
Plan Secrets, bootstrap data Secrets, workload kubeconfig Secrets, and workload credential Secrets that are required by the synchronized resources.
Provider referenced Secrets that must exist on the target side, including the encryption-provider Secret referenced by BaremetalCluster.spec.encryptionProviderConfigRef when Bare Metal DR is enabled.
Exact shared auth bundle: ServiceAccount/cpaas-system/baremetal-system-agent, Secret/cpaas-system/baremetal-system-agent-token, Role/cpaas-system/baremetal-system-agent, and RoleBinding/cpaas-system/baremetal-system-agent.

Do not synchronize:

SeedImage.
Standby Cluster/cpaas-system/global.
Standby BaremetalCluster/cpaas-system/global.
Standby BaremetalMachine, MachineInventoryPool, MachineInventory, and MachineRegistration objects that belong to the standby global cluster.
Secrets from the primary cluster that are not required by workload cluster reconciliation.

Secret Encryption Boundary

Do not raw-sync every key under /registry/secrets when the primary and standby apiservers do not share the same Kubernetes encryption-provider configuration. The standby apiserver cannot decrypt ciphertext written with a different key. Sync only the required Secrets through a safe path that lets the standby apiserver store them with the standby encryption key, or use an approved exact-sync mechanism for the fixed auth bundle.

Failover Procedure

Use this sequence for a planned or declared failover:

Fence the primary global write path. Stop or isolate the controllers and platform entry points that can continue writing workload cluster resources. Do this at the owning CR or platform-resource layer where possible; scaling a Deployment alone can be reverted by its owner.
Wait until primary-to-standby etcd-sync is caught up, then freeze or stop it. Do not allow old primary data to overwrite new standby writes after failover.
If your provider uses a sync gate, clear the gate after the sync has been stopped so standby controllers do not keep requeueing because they think synchronization is still in progress.
Switch the platform domain to the standby platform entrance.

Verify DNS and ingress:

dig +short <platform-domain>
curl -kI https://<platform-domain>/kubernetes/global/version
curl -kI https://<standby-control-plane-vip>/kubernetes/global/version

A 401 response is acceptable; it proves the request reached the platform ingress.

Confirm the standby cluster has the provider components needed to manage Bare Metal workload clusters: CAPI core, kubeadm provider, Bare Metal provider, elemental-operator, cluster-transformer, and alive-related delivery components.
Confirm existing workload hosts reconnect through the platform domain and can watch or patch their plan Secrets.

Post-Failover Validation

After standby is active, validate in increasing risk order:

Existing workload cluster remains Ready.
All BaremetalMachine objects remain bound to the expected MachineInventory objects.
Plan Secrets can be updated and the MachineInventory.status.plan feedback changes.
Worker scale-out allocates an available inventory and updates Role/baremetal-system-agent.resourceNames with the new plan Secret.
Worker scale-in runs a clean plan and releases inventory as designed.
Control-plane replacement restores the desired KCP replica count and etcd membership.
A new MachineRegistration and SeedImage created on standby can register a new host.
A pre-failover workload ISO is usable only if its registration URL uses the platform domain.

Bare Metal Troubleshooting

Symptom	Likely cause	First check
Handoff target list is empty	`dcs-import-extra-resources` was missing or incomplete before the installer API call	`kubectl -n cpaas-system get cm dcs-import-extra-resources -o yaml`; check `baremetal-system-agent-handoff` Job logs
Existing hosts get `401` or `Forbidden` after failover	ServiceAccount signing key, issuer, audience, token Secret, or Role `resourceNames` does not match	`sha256sum /etc/kubernetes/pki/sa.key /etc/kubernetes/pki/sa.pub`; `kubectl auth can-i` with the token
Existing hosts fail TLS verification after DNS switch	Platform certificate chain or SAN does not cover the platform domain and both VIPs	Check `cpaas-system/dex.tls`, certificate SANs, and `curl -kI https://<vip>/kubernetes/global/version`
Standby `global` resources are replaced by primary resources	`etcd-sync` ignore rules did not exclude `global` resources	Inspect the `etcd-sync` ignore ConfigMap and stop sync before restoring standby resources
New host cannot register with an old ISO	ISO registration URL points to bootstrap or primary-only IP, or `MachineRegistration` was not synchronized	Inspect the `MachineRegistration` status and rebuild the ISO on active standby when needed

Provider Notes

Huawei DCS

VMware vSphere

Huawei Cloud Stack

Bare Metal

Follow the DCS steps in Optional Disaster Recovery Deployment. The DCS installation must keep the same encryption provider Secret and DCSCluster.spec.encryptionProviderConfigRef on both sides. Do not add the encryption provider file to KubeadmControlPlane.spec.kubeadmConfigSpec.files for DCS. DCS provider resources are migrated by the built-in flow; create dcs-import-extra-resources only when extra resources must be imported.

Follow the VMware vSphere steps in Optional Disaster Recovery Deployment. The VMware vSphere installation uses the same KubeadmControlPlane encryption file entry and etcd SAN configuration as the primary installation. No VSphereCluster encryption Secret reference is required. Create the VMware vSphere dcs-import-extra-resources ConfigMap so the installer imports vSphere infrastructure resources and the global-vsphere-credentials Secret referenced by VSphereCluster.spec.identityRef.name.

Follow the HCS steps in Optional Disaster Recovery Deployment. The HCS installation uses the same KubeadmControlPlane encryption file entry and etcd SAN configuration as the primary installation. No HCSCluster encryption Secret reference is required. Create the HCS dcs-import-extra-resources ConfigMap so the installer imports HCS infrastructure resources and the Secret referenced by HCSCluster.spec.identityRef.name.

Follow the Bare Metal steps in Optional Disaster Recovery Deployment. Use two bootstrap KIND hosts, one for primary and one for standby. Before installing standby, switch the platform domain to the standby entrance so platform-rendered URLs resolve to the side being installed.

Bare Metal requires all of the following to be aligned:

Same Kubernetes encryption provider Secret content on both sides, referenced by BaremetalCluster.spec.encryptionProviderConfigRef.
The encryption provider Secret is imported into each final global cluster through dcs-import-extra-resources; it must not exist only in the bootstrap KIND cluster.
Same ServiceAccount signing key files and same issuer/audience values on both sides.
Same trusted platform certificate chain, with SANs that cover the platform domain, primary control-plane VIP, and standby control-plane VIP.
dcs-import-extra-resources created before the installer API call on both sides.
Bare Metal provider AppRelease values set with handoffHook.controlPlaneVIP for the local side and elemental.systemAgent.authMode: shared.

Do not sync SeedImage from primary to standby. Sync non-global MachineRegistration resources when workload ISOs need to remain valid across failover. Do not sync primary global MachineRegistration resources to standby; create them locally on the active side and set baremetal.cluster.io/system-agent-server-url to that side's control-plane VIP.

After failover, keep the primary fenced until a separate failback plan is executed. Do not run primary and standby Bare Metal providers against the same workload inventories at the same time.

Global Cluster Disaster Recovery

TOC

Deployment Procedure

Operational Scope

Bare Metal DR Model

Endpoint Rules

Shared Token Requirements

etcd-sync Scope

Failover Procedure

Post-Failover Validation

Bare Metal Troubleshooting

Provider Notes

See Also

#Global Cluster Disaster Recovery

#TOC

#Deployment Procedure

#Operational Scope

#Bare Metal DR Model

#Endpoint Rules

#Shared Token Requirements

#etcd-sync Scope

#Failover Procedure

#Post-Failover Validation

#Bare Metal Troubleshooting

#Provider Notes

#See Also

Global Cluster Disaster Recovery

TOC

Deployment Procedure

Operational Scope

Bare Metal DR Model

Endpoint Rules

Shared Token Requirements

etcd-sync Scope

Failover Procedure

Post-Failover Validation

Bare Metal Troubleshooting

Provider Notes

See Also