> For the complete documentation index, see [llms.txt](https://platform9.com/kb/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://platform9.com/kb/pcd-ts/vm-deployment/failed-to-deploy-virtual-machine.md).

# Virtual Machine Deployment Issues

## Problem

This guide provides step-by-step instructions for troubleshooting and resolving issues when creating a virtual machine (VM) fails in <code class="expression">space.vars.product\_name</code>.

## Environment

* Private Cloud Director Virtualization - v2025.4 and Higher
* Self-Hosted Private Cloud Director Virtualization - v2025.4 and Higher

## Various VM deployment Methods

* Launch an instance from an Image to quickly deploy a pre-configured environment.
* Launch an instance from a New Volume to create a fresh setup with dedicated storage.
* Launch an instance from an Existing Volume to utilize previously used storage for seamless continuation.
* Launch an instance from a VM snapshot to capture current state and restore it precisely as it was.
* Launch an instance from a Volume Snapshot to ensure data integrity by reverting to a specific point in time.

## Deep Dive

The <code class="expression">space.vars.product\_name</code> VM creation process is similar for all VM deployment methods mentioned earlier, and the workflow is orchestrated primarily by the **Compute service** (Nova). This flow involves a series of steps with critical validations at each stage to ensure the request is valid, resources are available, and the VM is provisioned correctly.

{% hint style="info" %}
**NOTE**

The Deep Dive section is for the Self-Hosted Private Cloud Director customer. For the SaaS customers, refer to the **Procedure** section.
{% endhint %}

### Step 1: User Request & API Validation

This is the initial stage where the user's request is received and authenticated.

1. **User Request:** A user submits a request to create a VM (also called an instance) via the OpenStack CLI, <code class="expression">space.vars.product\_name</code> dashboard, or direct API call. Key parameters are specified, including the **image**, **flavor**, **network**, **security group**, and **key pair**.
2. **Keystone Authentication:** The request is sent to the `nova-api-osapi` Pod, which immediately validates the user's authentication token with the **Identity service** (keystone). This ensures the user is who they claim to be. The output below shows the initial VM creation request was successfully received by the Nova API and was accepted with a 202 status code.

   <pre class="language-bash" data-title="Sample Logs" data-overflow="wrap"><code class="lang-bash">$ kubectl logs deployment/nova-api-osapi -n &#x3C;WORKLOAD_REGION> | grep "POST /v2.1"
   INFO nova.osapi_compute.wsgi.server [None [REQ_ID] [USER_ID] [TENANT_ID] - - default default] [IP] "POST /v2.1/[tenant_id]/servers HTTP/1.1" status: 202 len: [.] time: [.]
   </code></pre>

{% hint style="info" %}
**IMPORTANT**

Here a unique `REQ_ID` will be generated, which will be further used for tracking the request in other component logs.
{% endhint %}

3. **Authorization & Quota Checks:** The Nova API performs two key validations:
   * **Authorization:** It verifies that the user has the necessary permissions to create a VM within the specified project.
   * **Quota Check:** It confirms the project has enough available resources (vCPUs, RAM, instances, etc.) to fulfil the request based on the chosen flavor.
4. **Initial Database Entry:** The database name is `nova`. The `nova-conductor` service is the only service that writes to the database. The other Compute services access the database through the `nova-conductor` service. If all checks pass, `nova-conductor` creates a database record for the new VM and sets its status to '**BUILDING(None)**'.

### Step 2: Scheduling & Resource Selection

After the initial validation, the request is sent to the Nova Scheduler, which decides where to place the VM.

1. **Message Queue:** The Nova API sends a message to the **Nova Scheduler** via a message queue (RabbitMQ), containing all the VM's requirements.
2. The **Nova scheduler** queries the **Placement API** to find a suitable **resource provider (compute node)** that has enough resources based on below host filters and host weighing.
3. **Host Filtering:** The `Nova-scheduler` begins by filtering out unsuitable hosts. This process checks for:
   * **Resource availability:** It ensures the host has sufficient free RAM, disk space, and vCPUs.
   * **Compatibility:** It verifies the host is compatible with the image properties and any specific requirements.
   * **Availability Zones:** It confirms the host is in the requested availability zone.
   * **Image Metadata:** It checks the image metadata if there is a specific metadata filter for the image. E.g. Images with metadata SRIOV, vTPM, etc.
   * Many more other filters.

{% hint style="info" %}
**Other filters:**

Details on Nova filters are available on [Scheduler filters](https://docs.openstack.org/nova/rocky/user/filter-scheduler.html).
{% endhint %}

4. **Host Weighing:** The remaining hosts are then ranked based on a weighting system. This can be configured to prioritise hosts with the least load or those that have been least recently used to ensure balanced resource distribution.

{% hint style="warning" %}
**Warning!**

At this stage, if the scheduler doesn’t find any suitable host to deploy an instance, it gives a **“No Valid Host Found”** error.
{% endhint %}

5. **Placement Reservation:** The `nova-scheduler` service queries `placement API`to fetch eligible compute nodes. Once a host is selected, the scheduler **makes a provisional allocation** by creating a **"claim"** via: `PUT /allocations/[VM_UUID]`. Placement API `PUT` requests will have VM allocation ID logs that look like the following:

   <pre class="language-bash" data-title="Sample Logs" data-overflow="wrap"><code class="lang-bash">$ kubectl logs deployment/placement-api -n &#x3C;WORKLOAD_REGION> | grep &#x3C;REQ_ID>
   INFO placement.requestlog [[REQ_ID] [REQ_ID] [USER_ID] [TENANT_ID] - - default default] [IP] "PUT /allocations/[VM_UUID]" status: 204 len: 0 microversion: 1.36
   </code></pre>

The `nova-scheduler` pod logs can be reviewed against the request ID captured from `nova-api-osapi` the pod. In the snippet below, VM requests will verify a suitable host for VM deployment.

{% code title="Sample Logs" overflow="wrap" %}

```bash
$ kubectl logs deployment/nova-scheduler -n <WORKLOAD_REGION> | grep <REQ_ID>
WARNING nova.scheduler.filters.aggregate_image_properties_isolation [None [REQ_ID] [USER_ID] [TENANT_ID] - - default default] Host '[HOST_UUID]' has a metadata key 'availability_zone' that is not present in the image metadata.
```

{% endcode %}

6. The **Nova-scheduler** send the database update request with the host information to **Nova-conductor** which further updates the database and sets VM status to '**BUILDING (Scheduling)**'. Then the request is passed to the **Nova-compute** service.

### Step 3: Compute & Final Service-Level Validation

The **Nova-compute** service on the selected host performs the final provisioning steps.

1. **Resource Allocation:** The **Nova Compute** service receives the scheduling decision and begins allocating resources. It interacts with:
   * **Image Service:** It requests the VM image. **Validation occurs here** as `glance-api` pod can perform a signature check to ensure the image's integrity. If an image is not available, then it errors out. Below is an example of an `GET` image request.

```dart
INFO eventlet.wsgi.server [None [REQ_ID] [USER_ID] [TENANT_ID] - - default default] 127.0.0.1 - - [...] "GET /v2/images/[IMAGE_UUID] HTTP/1.0" 200 1132 0.032435
```

{% hint style="info" %}

* **Neutron:** It requests network resources, and `neutron-server` pod **validates** that the specified network and security groups exist and are accessible to the user. It then allocates a virtual network interface and an IP address. Below example shows the IP and network interface port information. **Nova-conductor** which further updates the database and sets VM status to ‘**BUILDING(Networking)**’.
  {% endhint %}

{% code title="Sample Logs" overflow="wrap" %}

```bash
$ kubectl logs deployment/neutron-server -n <WORKLOAD_REGION>
INFO neutron.wsgi [[REQ_ID] [REQ_ID] [USER_ID] [TENANT_ID] - - default default] 127.0.0.1 "GET /v2.0/floatingips?fixed_ip_address=[VM_IP_Address]&port_id=[VM_Interface_Port_ID] HTTP/1.1" status: 200  len: [.] time: [.]
```

{% endcode %}

{% hint style="info" %}

* **Cinder (if applicable):** If a persistent boot volume is requested, **Cinder validates** that the volume is available and attaches it to the VM. **Nova-conductor** which further updates the database and sets VM status to ‘**BUILDING(Block\_Device\_Mapping)**’.
  {% endhint %}

2\. **Hypervisor Instruction:** Once all resources are confirmed, `nova-compute` instructs the `pf9-ostackhost` service on the hypervisor (Libvirtd KVM) to create the VM using the image, flavor, and other parameters. The VM then boots. The `pf9-ostackhost` logs looks like the below, which outlines the details like claim successful, device path, network information, time elapsed to spawn an instance, etc.

{% code title="Sample Logs of pf9-ostackhost.log:" overflow="wrap" %}

```bash
INFO nova.compute.claims [[REQ_ID] [USERNAME] service] [instance: [VM_UUID]] Claim successful on node [SELECTED_NODE_NAME]
..
INFO os_vif [[REQ_ID] [USERNAME] service] Successfully plugged vif VIFOpenVSwitch(active=False,address=[MAC_ADDRESS],bridge_name='br-int',has_traffic_filtering=True,id=[INTERFACE_ID],network=Network([NETWORK_ID]),plugin='ovs',port_profile=VIFPortProfileOpenVSwitch,preserve_on_delete=False,vif_name='[VM_TAP_INTERFACE]')
..
INFO nova.compute.manager [[REQ_ID] [USERNAME] service] [instance: [VM_UUID]] Took 3.74 seconds to spawn the instance on the hypervisor.
```

{% endcode %}

### Step 4: VM Configuration & Finalization

The final step involves configuring the guest OS and updating the status.

1. **Cloud-init:** As the VM boots, **Cloud-init** runs with `169.254.169.254` IP address and retrieves metadata from Nova. The cloud-init logs are available within the VM. It performs validations on this metadata before:
   * Injecting the SSH key.
   * Configuring networking and the hostname.
   * Executing any custom user data scripts.
2. **Status Update:** The `nova-compute` service updates the VM's status in the database to '**ACTIVE**', indicating a successful creation. The VM is now ready for the user to access.

## Procedure

### 1. Get the VM status

Use the PCD UI or CLI to check the error message.

{% hint style="info" %}
**Note**

OpenStack CLI references virtual machines as '`server`'. &#x20;

Look for `status` and `fault` fields to understand the issue.
{% endhint %}

{% code overflow="wrap" %}

```bash
$ openstack server show <VM_UUID>
```

{% endcode %}

### 2. Validate Compute Service Status

Get the Compute Service state and ensure it is `up` and status is `enabled`.

{% code overflow="wrap" %}

```bash
$ openstack compute service list
```

{% endcode %}

### 3. Trace the VM Events:

Retrieve the Request ID i.e `REQ_ID` from the server event list details, which uniquely identifies the request. This `REQ_ID` is displayed in the first column of the server events list command and helps track request failures.&#x20;

{% hint style="warning" %}
**Note:**

This `REQ_ID` is crucial for troubleshooting the VM creation issues.
{% endhint %}

{% code overflow="wrap" %}

```bash
$ openstack server event list <VM_UUID>
$ openstack server event show <VM_UUID> <REQ_ID>
```

{% endcode %}

### 4. Review the Pods and its logs on the Management plane

{% hint style="info" %}
**NOTE**

Step 4 is applicable only for the Self-Hosted Private Cloud Director users.
{% endhint %}

Management plane have Pods like `Nova-api-osapi`, `Nova-scheduler` and `Nova-conductor`**.** Review all these pods:

* Check if they are in "`CrashLoopBackOff/OOMkilled/Pending/Error/Init`" state.
* Also, verify if all containers in the pods are Running.
* See the events section in pod describe output.
* Review pods logs using `REQ_ID` or `VM_UUID` for relevant details.

  <pre class="language-bash" data-overflow="wrap"><code class="lang-bash">$ kubectl get pods -o wide -n &#x3C;WORKLOAD_REGION> | grep -i "nova"

  $ kubectl describe -n &#x3C;WORKLOAD_REGION> &#x3C;NOVA_API_OSAPI_POD>
  $ kubectl describe -n &#x3C;WORKLOAD_REGION> &#x3C;NOVA_SCHEDULER_POD>
  $ kubectl describe -n &#x3C;WORKLOAD_REGION> &#x3C;NOVA_CONDUCTOR_POD>

  $ kubectl logs -n &#x3C;WORKLOAD_REGION> &#x3C;NOVA_API_OSAPI_POD>
  $ kubectl logs -n &#x3C;WORKLOAD_REGION> &#x3C;NOVA_SCHEDULER_POD>
  $ kubectl logs -n &#x3C;WORKLOAD_REGION> &#x3C;NOVA_CONDUCTOR_POD>
  </code></pre>

### 5. Validate the Image and Flavor.

Check if the image is available and not corrupted. Resource requested in flavor is available on the underlying hosts.

{% code overflow="wrap" %}

```bash
$ openstack image show <IMAGE_ID>
$ openstack flavor show <FLAVOR_ID>
$ openstack hypervisor stats show
$ openstack hypervisor show <HYPERVISOR_NAME>
```

{% endcode %}

### 6. Validate the service status on the affected VM's underlying hypervisor

Validate if services listed below are running on the Underlying Hypervisor

{% code title="Host" overflow="wrap" %}

```bash
$ sudo systemctl status pf9-hostagent
$ sudo systemctl status pf9-ostackhost 
$ sudo systemctl status pf9-cindervolume-base
$ sudo systemctl status pf9-neutron-ovn-metadata-agent
```

{% endcode %}

### 7. Check the logs on the affected VM's hypervisor

**Compute Host**: `Ostackhost` logs are responsible for provisioning the compute resource required by the VM. Review the latest logs and search for `REQ_ID` or `VM_UUID`.

{% code title="Host" overflow="wrap" %}

```bash
$ less /var/log/pf9/ostackhost.log
```

{% endcode %}

**Storage Host**: `cindervolume-base` logs are responsible for provisioning the storage resources required by the VM. Review the latest logs and search for `REQ_ID` or `VM_UUID`.

{% code title="Host" overflow="wrap" %}

```bash
$ less /var/log/pf9/cindervolume-base.log
```

{% endcode %}

**Network Host**: `pf9-neutron-ovn-metadata-agent` logs are responsible for provisioning the connectivity and networking resources required by the VM. Review the latest logs and search for `REQ_ID` or `VM_UUID`.

{% code title="Host" overflow="wrap" %}

```bash
$ less /var/log/pf9/pf9-neutron-ovn-metadata-agent.log
```

{% endcode %}

8. If these steps prove insufficient to resolve the issue, kindly reach out to the [Platform9 Support Team](https://support.platform9.com/) for additional assistance.

## Most common causes

* Insufficient resources (CPU, RAM, storage).
* Incorrect network configurations or security groups.
* Unavailable or corrupted images.
* Issues with the scheduler or compute nodes.
* Permission or quota restrictions.
* Virtualised cluster mismatches.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://platform9.com/kb/pcd-ts/vm-deployment/failed-to-deploy-virtual-machine.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.