Worker Nodes in NotReady due to System OOM Encountered

Problem

Worker Nodes in NotReady due to System OOM Encountered.

Environment

  • Platform9 Managed Kubernetes - All Versions

Answer

  • System logs indicate Java invoked oom-killer task.
System Logs
Copy
  • The load average on the system at this point is also extremely high.
Muster Logs
Copy
  • The node recovered after sometime and transitioned back to Ready state. End user will need to work internally with their application teams to figure out the issue.
Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard