Host Reporting Offline in Clarity UI
Problem
The Clarity UI reports the host as "offline" even though it is up and running.
Environment
- Platform9 Managed OpenStack - All Versions
- Platform9 Managed Kubernetes - All Versions
Cause
One or more of the following can be the possible cause(s).
Disk Utilization is 100%
The Hostagent daemon logs its events to /var/log/pf9/hostagent.log.
In an event where the /var file system gets full, Hostagent will fail to log, thus failing to send heartbeats to the Management Plane
Check the disk usage.
Platform9 Hostagent and/or Comms Service may not be Running
- Check the status of the services.
Host Unable to Communicate With Management Plane: "ETIMEDOUT"
Stalled vgs Execution
- The pf9-hostagent daemon collects certain system utilization information like
vgs
, which at times can stall if LVM filters are not set. - Check whether the vgs process is running.
- The pf9-hostagent daemon collects certain system utilization information like
System Load Average is High
- When all the CPUs are consumed, the pf9-hostagent fails to get a CPU cycle. As a result, pf9-hostagent is unable to send a heartbeat to the Management Plane.
- Check the system load average using the
top
oruptime
command. - Check the number of CPUs on the host/node using the
lscpu
command. - If the system load average is greater than the number of CPUs, identify the source of the load. However, if the load is genuine, increase the number of CPUs.
Platform9 Hostagent Daemon is Taking Too Long to Generate A Support Bundle
- When the command to generate a support bundle is fired, it pulls all the logs from /var/log/pf9.
- If the size of the directory is greater than 2GB, it takes time to copy and compress the logs.
- As pf9-hostagent is a single-threaded process, it will neither log events to /var/log/pf9/hostagent.log nor send heartbeats to the Management Plane, until the support bundle is generated.
- Check if the process is in a 'D' (defunct) state.
Resolution
Resolve based on the respective cause(s) stated above.
Was this page helpful?