Unable to Initiate Compute Driver: "Node <uuid> could not be found"

Problem

  • Nova instances in the baremetal region are stuck in "BUILD" state.
  • The baremetal controller node is observed to be in a failed state in the Platform9 dashboard and/or via Resource Manager (Resmgr) API.
  • /var/log/pf9/hostagent.log shows "In failed state until next set_config message...". Prior to this, the pf9-ostackhost and pf9-novncproxy services may show as continuously setting their service state, e.g.
Bash
Copy
  • Meanwhile, the /var/log/pf9/ostackhost.log is logging an error similar to the following.
Bash
Copy

Environment

  • Platform9 Managed OpenStack - v5.6.6
  • Ironic/Baremetal Service
  • Nova

Cause

If the baremetal node was re-provisioned, and the associated Nova instance was not deleted/cleaned up, then, this error is to be expected.

If the baremetal node was not re-provisioned at any point, please contact Platform9 Support for further analysis.

Resolution

Baremetal Node was Re-Provisioned

  1. Delete the associated Nova instance (via the Platform9 UI, or OpenStack CLI).
  2. Restart the pf9-ostackhost service on the baremetal controller node.
Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard