Excessive Kubernetes Master Pod Restarts Due To ETCD Latency.
Problem
❯ kubectl get po -n kube-system k8s-master-172.17.0.14
NAME READY STATUS RESTARTS AGE
k8s-master-fe9d1e3a-4c43-417b-9720-c2a3d0732d9d000003 3/3 Running 119 27d{"log":"{\"level\":\"warn\",\"ts\":\"2023-11-16T22:23:08.031Z\",\"caller\":\"etcdserver/util.go:163\",\"msg\":\"apply request took too long\",\"took\":\"9.069436957s\",\"expected-duration\":\"100ms\",\"prefix\":\"read-only range \",\"request\":\"key:\\\"/registry/horizontalpodautoscalers/\\\" range_end:\\\"/registry/horizontalpodautoscalers0\\\" limit:10000 \",\"response\":\"\",\"error\":\"etcdserver: request timed out\"}<br>","stream":"stderr","time":"2023-11-16T22:23:08.031887945Z"}{"log":"E1116 23:58:01.449568 1 leaderelection.go:325] error retrieving resource lock kube-system/kube-controller-manager: Get \"https://localhost:443/apis/coordination.k8s.io/v1/namespaces/kube-system/leases/kube-controller-manager?timeout=5s\": net/http: request canceled (Client.Timeout exceeded while awaiting headers)<br>","stream":"stderr","time":"2023-11-16T23:58:01.450015023Z"}{"log":"E1116 23:59:06.732861 1 status.go:71] apiserver received an error that is not an metav1.Status: rpctypes.EtcdError{code:0xe, desc:\"etcdserver: request timed out\"}: etcdserver: request timed out<br>","stream":"stderr","time":"2023-11-16T23:59:06.742609594Z"}{"L":"INFO","T":"2023-11-16T17:21:19.642-0700","C":"command/command.go:120","M":"[2023-11-16 17:21:19] I1116 17:21:19.625769 3204532 request.go:1123] Response Body: {\"kind\":\"Status\",\"apiVersion\":\"v1\",\"metadata\":{},\"status\":\"Failure\",\"message\":\"etcdserver: request timed out\",\"code\":500}"}Environment
Cause
Resolution
Additional Information
PreviousError Initializing Docker: "devicemapper: Non existing device docker--vg-docker--pool"NextApplications Failing With Error "504 Gateway Timeout
Last updated
