Kubelet logs UnmountVolume.NewUnmounter Failure logs On vol_data.json Is Deleted
Problem
Kubelet continuously logs below logs on failing to unmount a volume from a pod and filling up on disk usage.
Error reconciler.go:193] "operationExecutor.UnmountVolume failed (controllerAttachDetachEnabled true) for volume \"filestore\" (UniqueName\"kubernetes.io/csi/csi.trident.netapp.io^pvc-<PVC_UUID>\") pod \"<POD_ID>\" (UID\"<POD_ID>\") UnmountVolume.NewUnmounter failed for volume \"filestore\" (UniqueName\"kubernetes.io/csi/csi.trident.netapp.io^pvc-<PVC_ID>\") pod \"<POD_ID>\" (UID\"<POD_ID>\") kubernetes.io/csiunmounter failed to load volume data file /var/lib/kubelet/pods/<POD_ID>/volumes/kubernetes.io~csi/pvc-<PVC_ID>/mountkubernetes.io/csifailed to open volume data file /var/lib/kubelet/pods/<POD_ID>/volumes/kubernetes.io~csi/pvc-<PVC_ID>/vol_data.jsonopen /var/lib/kubelet/pods/<POD_ID>/volumes/kubernetes.io~csi/pvc-<PVC_ID>/vol_data.jsonno such file or directory" err="UnmountVolume.NewUnmounter failed for volume \"filestore\" (UniqueName\"kubernetes.io/csi/csi.trident.netapp.io^pvc-<PVC_ID>\") pod \"<POD_ID>\" (UID\"<POD_ID>\") kubernetes.io/csiunmounter failed to load volume data file /var/lib/kubelet/pods/<POD_ID>/volumes/kubernetes.io~csi/pvc-<PVC_ID>/mountkubernetes.io/csifailed to open volume data file /var/lib/kubelet/pods/<POD_ID>/volumes/kubernetes.io~csi/pvc-<PVC_ID>/vol_data.jsonopen /var/lib/kubelet/pods/<POD_ID>/volumes/kubernetes.io~csi/pvc-<PVC_ID>/vol_data.jsonno such file or directory"Environment
- Platform9 Managed Kubernetes - v5.3 and Higher
Cause
This is a known upstream bug https://github.com/kubernetes/kubernetes/issues/101911 and https://github.com/kubernetes/kubernetes/pull/102576
Workaround
- Find the path of the pod mentioned in logs
# ll /var/lib/kubelet/pods/<POD_ID>/drwxr-x---. 3 root root 37 May 20 14:53 pluginsdrwxr-x---. 5 root root 87 May 20 14:53 volumes- Check if the above pod running in the cluster
# kubectl get pods -o custom-columns=PodName:.metadata.name,PodUID:.metadata.uid -A | egrep "<POD_ID>"#- Remove the content of those pod directories and restart
pf9-kubelet
# rmd -Rf /var/lib/kubelet/pods/<POD_ID># systemctl restart pf9-kubeletWas this page helpful?