Host Aggregates Not Showing Availability Status in UI, Instead Displaying "Hosts are Loading"

Problem

The Host Aggregates of the clusters displayed the status "Hosts are loading" instead of the expected availability status in the Platform9 UI. High Availability (HA) functionality appeared to be affected.

Environment

  • Platform9 Managed OpenStack - v5.10.1 and Higher
  • Component - pf9-hamgr on the control plane

Cause

The pf9-hamgr service hit the file descriptor limit, which caused:

  • Failure to read certificate files required for operation
  • Broken connectivity to the Nova API
  • The HA Manager service becoming unresponsive
  • CURL requests to HA Manager endpoint returning 502 errors

This results in the UI being unable to retrieve and display the Host Aggregate status properly.

The following diagnosis, resolution steps involve commands that must be run on the Platform9 control plane. Please contact Platform9 Support for assistance.

Diagnosis

  • From the Management Plane, the /var/log/pf9/hamgr/hamgr.log logs showed:
Management Plane
Copy
  • The curl requests to HA reports below 502 error:
Management Plane
Copy
  • The pf9-hamgr-server process was not listening on the expected port , indicating it had stopped responding:
Management Plane
Copy
  • Too many open files by hamgr process:
Management Plane
Copy

Resolution

Restarting the HA Manager service, resolves the issue:

Management Plane
Copy

Post-restart:

  • The service resumed listening on port
  • Open file descriptors dropped to normal operating levels
  • Error logs stopped appearing

Validation

From the Management Plane:

After the restart:

  • Verified that pf9-hamgr-server is listening on port :
Management Plane
Copy
  • Check the number of open files:
Management Plane
Copy

From the UI:

In the UI, the user can verify the resolution by navigating to the Infrastructure Aggregates. The aggregates should display the status "Available" instead of "Hosts are loading".

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard