GPU VM Cold Migration / Resize Fails: Insufficient Compute Resources
Problem
Environment
Diagnostics
[pci] alias = {"vendor_id": "[VENDOR_ID]", "product_id": "[PRODUCT_ID]", "device_type": "type-PF","name": "[DEVICE_NAME]", "live_migratable": "yes"}INFO nova.compute.claims [instance: [INSTANCE_UUID]] Failed to claim: Claim pci failed ERROR nova.compute.manager [instance: [INSTANCE_UUID]] Setting instance vm_state to ERROR: nova.exception.ComputeResourcesUnavailable: Insufficient compute resources: Claim pci failed.DEBUG nova.pci.stats PCI claim: Starting with 1 devices, request count: 1 DEBUG nova.pci.stats PCI claim: Request spec: [{'vendor_id': '[VENDOR_ID]', 'product_id': '[PRODUCT_ID]','dev_type': 'type-PF', 'live_migratable': 'true'}] DEBUG nova.pci.utils PCI attribute check: key=live_migratable, spec_value=true, device_value=None DEBUG nova.pci.utils PCI attribute mismatch: live_migratable spec="true" != device="None" DEBUG nova.pci.stats Dropped 1 device(s) due to mismatched PCI attribute(s) DEBUG nova.pci.stats Not enough PCI devices left to satisfy request after spec filtering DEBUG nova.pci.stats PCI claim: Available: 0, Required: 1
Cause
Resolution
Workaround
1
2
3
4
5
6
Validation
Additional Information
PreviousHow to Enable PCI Resource Class Based Placement Reporting in NovaNextGPU VM Resize and Cold Migration Fail with ReshapeFailed
Last updated
