Instances Stop Randomly Due to Out of Memory Error

Problem

  • Multiple instances stop randomly.
  • We can see 'Out of memory' errors on the host.
  • We can see 'Instance is already powered off in the hypervisor when a stop is called' messages in /var/log/pf9/ostackhost.log on the host.
Copy
  • We can see below errors in /var/log/messages.log.
Copy

Environment

  • Platform9 Managed OpenStack - All Versions

Cause

Kernel initiated Out Of Memory Killer (OOM Killer).

OOM Killer is a process that the Linux kernel employs when the system is critically low on memory.

As there is not enough memory on the host, the kernel has initiated the OOM killer which sacrificed the qemu/kvm process and the instances on the host have stopped.

Resolution

Add more memory on the host or increase swap if physical memory/swap space is low.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard