Calico-kube-controller Pod Restarts Frequently Due To OOM- Memory Exhaustion.

Problem

The calico-kube-controller pod is getting restarted frequently due to OOM- memory exhaustion with 137 error code:

Calico-kube-controller pod describe output during time of issue
Copy

Environment

  • Platform9 Managed Kubenetes - v5.6.8.
  • Kubernetes version 1.23.8.

Answer

This is a known issue, a jira- PMK-6180 has already been filed to track this issue and resolve it. The fix will be available in upcoming patch release.

Workaround

Modify the readiness probe timeout to 10 seconds and increase the memory limit on the pod to 2Gi.

Before modification:

Before modifying calico-kube-controller deployment
Copy

Modify the calico-kube-controllers deployment using below command:

Edit deployment calico-kube-controllers
Copy

After modification using:

After modifying calico-kube-controller deployment
Copy

Additional Information

This is known bug with JIRA ID: PMK-6180

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard