Instance Reboot Failure as Libvirt Failed to Terminate qemu-kvm Zombie Process
Problem
265 WARNING nova.virt.libvirt.driver [req-95e44efb-e57a-4242-8af5-0fb8744f1292 root@org.com Production] [instance: f828064b-91df-4ab1-93f4-2e4d57bb38b2] Error from libvirt during destroy. Code=38 Error=Failed to terminate process 131611 with SIGKILL: Device or resource busy; attempt 3 of 3268 ERROR nova.compute.manager [req-95e44efb-e57a-4242-8af5-0fb8744f1292 root@org.com Production] [instance: f828064b-91df-4ab1-93f4-2e4d57bb38b2] Cannot reboot instance: Failed to terminate process 131611 with SIGKILL: Device or resource busy335 INFO nova.compute.manager [req-95e44efb-e57a-4242-8af5-0fb8744f1292 root@org.com Production] [instance: f828064b-91df-4ab1-93f4-2e4d57bb38b2] Successfully reverted task state from reboot_started_hard on failure for instance.$ ps -ef | egrep 'PID | 131611'UID PID PPID C STIME TTY TIME CMDqemu 131611 1 99 2019 ? 274-21:21:49 [qemu-kvm] [defunct]$ ps -ef | head -2UID PID PPID C STIME TTY TIME CMDroot 1 0 0 2018 ? 15:58:57 /usr/lib/systemd/systemd --switched-root --system --deserialize 22$ sudo journalctl -u libvirtdJan 19 06:55:57 org.com libvirtd[98498]: 2020-01-19 11:55:57.336+0000: 98502: error : qemuAgentSend:930 : Guest agent is not responding: Guest agent not available for nowJan 19 06:58:12 org.com libvirtd[98498]: 2020-01-19 11:58:12.634+0000: 98499: error : virProcessKillPainfully:401 : Failed to terminate process 131611 with SIGKILL: Device or resource busyJan 19 06:58:27 org.com libvirtd[98498]: 2020-01-19 11:58:27.653+0000: 98501: error : virProcessKillPainfully:401 : Failed to terminate process 131611 with SIGKILL: Device or resource busyJan 19 06:58:42 org.com libvirtd[98498]: 2020-01-19 11:58:42.666+0000: 98504: error : virProcessKillPainfully:401 : Failed to terminate process 131611 with SIGKILL: Device or resource busyJan 21 08:17:31 org.com libvirtd[98498]: 2020-01-21 13:17:31.987+0000: 98501: error : virProcessKillPainfully:401 : Failed to terminate process 131611 with SIGKILL: Device or resource busyJan 21 08:17:31 org.com libvirtd[98498]: 2020-01-21 13:17:31.989+0000: 98500: error : qemuDomainObjBeginJobInternal:4721 : Timed out during operation: cannot acquire state change lock (held by remoteDispatchDomainMemoryStats)Environment
Cause
Resolution
Additional Information
PreviousGlance Image Create Fails With Error "Request Entity Too LargeNextOpenStack Commands Fail With "Expecting to Find Domain in User
Last updated
