Nodelet Fails at "Configure and start pf9-bouncer" Phase

Problem

Platform9 nodelet phase fails at "Configure and start pf9-bouncer" phase with "bind address already in use" on bouncer logs.

Bash
Copy

Nodelet logs keep showing "Waiting for "authn_webhook_listening" to evaluate to true" message.

Bash
Copy

Environment

  • SMCP version 5.9.1.

Cause

On the underlying node stale bouncer process keeps holding port 9977.

Resolution

Verify the stale process by running the below command:

Bash
Copy

Once we get the process ID follow the procedure given below:

  1. Stop the pf9-nodeletd and pf9-hostagent service - systemctl stop pf9-hostagent pf9-nodeletd.
  2. Force stop the nodelet phases using $ /opt/pf9/nodelet/nodeletd phases stop --force .
  3. Kill the stale bouncerd process using $ kill -9 <bouncerd-process-id> .
  4. Start nodelet phase using - systemctl start pf9-hostagent.
  5. Verify the nodelet phase status using $ /opt/pf9/nodelet/nodeletd phases status .

Additional Information

The above procedure helps to recover the nodelet phase issue due to the bouncer bind address already in use error.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard