Tuning Kubelet Garbage Collection & Eviction Thresholds for Devicemapper

Problem

Kubelet does not perform garbage collection with Docker as the underlying Container Runtime using the Device Mapper storage driver.

Environment

Platform9 Managed Kubernetes – All Versions
Kubelet
Docker
Devicemapper

Due to a an alleged discrepancy in the Kubernetes code, and based on observations made when querying the Kubelet resource metrics, it appears that Kubelet does not properly record the image filesystem usage based on the DM thin-pool; rather, the disk capacity is based on the root disk.

Resolution

Option A (Recommended): Switch to Supported Storage Driver (Overlay2)

Stop the Hostagent and Nodelet daemon services on each worker node.

Bash
    
 
systemctl stop pf9-{hostagent,nodeletd}
Copy

The node will now show as offline in the Platform9 UI, and you may receive a host-down notification.

Issue a stop for the Nodelet phases.

Bash
    
 
sudo /opt/pf9/nodelet/nodeletd phases stop
Copy

All running pods will be drained and all running containers destroyed. Kubelet will no longer report its status, and the Docker daemon will be brought down also.

Follow Steps #2-#4 from Configuring Docker with the overlay2 Storage Driver.
Start the Hostagent service.

Bash
    
 
systemctl start pf9-hostagent
Copy

Option B: Tune Kubelet Parameters for Garbage Collection & Eviction Thresholds

Run the docker info command on the worker node and identify the Data loop file .

Bash
    
 
docker info | grep /var/lib  Data loop file: /var/lib/docker/devicemapper/devicemapper/data  Metadata loop file: /var/lib/docker/devicemapper/devicemapper/metadata Docker Root Dir: /var/lib/dockerWARNING: the devicemapper storage-driver is deprecated, and will be removed in a future release.WARNING: devicemapper: usage of loopback devices is strongly discouraged for production use.         Use `--storage-opt dm.thinpooldev` to specify a custom block storage device.
Copy

Check the size of the disk/partition on which the data loop file exists and note it down.

Bash
    
 
df -h /Filesystem                 Size  Used Avail Use% Mounted on/dev/mapper/centos00-root  1.4T   86G  1.3T   7% /
Copy

Check the size of the data loop file itself and note it down also.

Bash
    
 
ls -lh /var/lib/docker/devicemapper/devicemapper/data-rw-------. 1 root root 100G Jul 13 11:52 /var/lib/docker/devicemapper/devicemapper/data
Copy

Backup the current worker ConfigMap – worker-default-kubelet-config .

Bash
    
 
kubectl get configmap -n kube-system worker-default-kubelet-config -o yaml > worker-default-kubelet-config.yaml
Copy

Edit the worker-default-kubelet-config ConfigMap, and set the following parameters for Garbage Collection (GC) and Eviction Thresholds.

Bash
    
 
kubectl edit -n kube-system worker-default-kubelet-config
Copy

Bash
    
 
evictionHard:  "imagefs.available": "89%" // evictionSoft - 5evictionSoft:  imagefs.available: "94%" // 100 - ((imagefs * 0.85) / rootdiskfs * 100)evictionSoftGracePeriod:  imagefs.available: "5m30s"imageGCHighThresholdPercent: 4 // (100 - evictionSoft) - XimageGCLowThresholdPercent: 1 // < imageGCHighThreshold
Copy

(If necessary, should Kubelet not consume the updated configuration automatically.) Restart the Kubelet service on the worker(s).

Bash
    
 
systemctl restart pf9-kubelet
Copy

Troubleshooting

Scenario: Kubelet Crashed

If Kubelet has crashed with an unexplainable stacktrace or error, it is likely that there was an error in the configuration. Take the following steps to restore the worker(s).

Backup the Kubelet dynamic configuration directory.

Bash
    
 
tar -czvf /var/opt/pf9/kube/kubelet-config/dynamic-config dynamic-config-$(date +%s).tgz
Copy

Recursively remove the directory.

Bash
    
 
rm -rf /var/opt/pf9/kube/kubelet-config/dynamic-config
Copy

Restart the Kubelet service.

Bash
    
 
systemctl restart pf9-kubelet
Copy

Additional Information

Last updated on

Was this page helpful?

Tuning Kubelet Garbage Collection & Eviction Thresholds for Devicemapper

Problem

Environment

Cause

Resolution

Option A (Recommended): Switch to Supported Storage Driver (Overlay2)

Troubleshooting

Scenario: Kubelet Crashed

Additional Information