VMs Inaccessible due to Underlying Hypervisors Frequently Going Offline

Problem

The VMs are inaccessible via SSH and console as the Hypervisor connectivity is flapping frequently.

$ ssh ubuntu@<VM_IP>
Connection closed by [VM_IP] port 22

Environment

  • Private Cloud Director Virtualization - till v2025.4-497

  • Self-Hosted Private Cloud Director Virtualization - till v2025.4-497

  • Component - Networking

Cause

  • When the DHCP lease expires, the host interface gets reconfigured. During this process, interfaces that are attached to the bridge also receive IP addresses unexpectedly. Because of this, ovn-controller enters a failed state while processing its configuration updates.

  • This is a known issue and is reported internally with ID: PCD-2492.

Diagnostics

neutron logs:
pf9_app.py ERROR - pf9-neutron-ovn-controller:get_config failed: b'' Command: sudo ip route add default via [IP] dev br-phy1 metric 0 failed. 2:b'RTNETLINK answers: File exists\\n' 
session.py ERROR - Bad message, app config or reading current app config. Message : {'opcode': 'heartbeat'} Traceback (most recent call last):   
File "_opt_pf9_hostagent_lib_python3.9_site-packages_bbslave_session.py", line 625, in handle_msg     current_config = get_current_config()  
File "_opt_pf9_hostagent_lib_python3.9_site-packages_bbslave_session.py", line 262, in get_current_config     config = app_db.get_current_config()   
File "_opt_pf9_hostagent_lib_python3.9_site-packages_pf9app_app_db.py", line 80, in get_current_config     'config': app.get_config(),   
File "_opt_pf9_hostagent_lib_python3.9_site-packages_pf9app_pf9_app.py", line 311, in get_config     raise ConfigOperationError() pf9app.exceptions.ConfigOperationError

Network related files:

Resolution

The fix for the bug is available in the v2025.6 and above releases.

Workaround

Check if the interface IP is assigned to the bridge and the default route is associated with the bridge.

  • If yes, delete the default route via bondx.x.

  • If not, reboot the host and check again.

Additional Information

Last updated