New Nodes are Unable to Join a PMK Cluster

Problem

  • Node scaleout fails. The node gets added to the management plane and is visible in the UI under cluster but not completely joined to the cluster.
  • Below errors could be seen in the Nodeletd logs from the respective node.
Bash
Copy

Environment

  • Platform9 Managed Kubernetes - v4.3 and Higher

Cause

  • This condition could be seen when a cluster upgrade in past was not fully completed and got stuck in the middle due to some reason.
  • The below could be seen in the Qbert database.
Bash
Copy

Resolution

  • Login to the Management Plane UI and go to the clusters section. Scroll to the right to see if there's an option Continue Upgrade available for the respective cluster.
  • Click on Continue Upgrade option so that the post upgrade is completed and the Qbert gets updated with the desired version.
  • Post this operation is executed successfully, the scaled out node should be added to the cluster automatically and should be in Ready state.
  • If not, restart PMK stack on the affected node using the following steps:
Bash
Copy

Additional Information

Make sure that all the nodes that are part of the respective cluster are in Connected state for the Continue Upgrade to execute successfully.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard