Orchestrating and Securing your AI Factory Infrastructure with Nutanix Agentic AI

By William Parks, Luke Congdon, Aashica Amrith

March 16, 2026 8:26 pm |

min

The promise of AI is massive, but the infrastructure required to power it often forces a difficult choice: do you prioritize the raw speed of bare-metal hardware or the security and management benefits of a virtualized environment? For too long, organizations have been forced to choose. Now, Nutanix is addressing this trade-off, delivering the near bare-metal speeds you need without sacrificing the control and security your enterprise demands.

How do we do this? We power GPU-dense platforms, such as the NVIDIA HGX systems, with the Nutanix AHV hypervisor and leveraging Topology-Aware Scheduling with AI optimized System-Defined VM profiles. Topology-aware scheduling improves AI workload performance by ensuring that virtualized resources (vCPUs, memory) are physically aligned with the hardware accelerators (GPUs, NICs) they utilize. AI-optimized VM profiles improve performance by automating complex, hardware-specific tuning that would otherwise require deep manual expertise.

The integration between Nutanix AHV, Nutanix Flow and NKP allows platform engineers to automate the deployment of AI workloads on complex, GPU-dense passthrough servers using these AI-optimized VM profile-based virtual machines as GPU-dense Kubernetes worker nodes for their AI workloads.

Capable of operating in fully air-gapped FIPS-compliant environments, this solution is designed to deliver the multitenancy and security of virtualization. It also provides the throughput and performance required for massive AI workloads, transforming the Nutanix Cloud Platform into a foundational engine for Enterprise Sovereign AI.

High Performance AI Compute

We’ve optimized Nutanix AHV to help ensure the hypervisor enables highly performant placement of VMs for your AI workloads.

Topology-Aware Scheduling: AHV aligns your workloads to optimize for performance by placing your VMs on physical NUMA nodes, aligning CPU, Memory and GPU resources. AHV also helps ensure the correct GPU and NIC mapping to avoid hops between processors. This topology-aware scheduling aligns all resources—NIC, CPU, Memory–to be as close as possible to the GPU to minimize latency for the workloads, even when other pathways exist to minimize latency.
8-GPU Passthrough: We enable a single VM to consume all 8 x SXM GPUs on an HGX board. This is critical for massive LLM training and inferencing jobs that require the full power of the board without splitting resources.
AI-Optimized VM Profiles: Manual tuning is the enemy of performance. By using pre-validated profiles (e.g., ai.large.h200.8gpu.cx7_ib), we automatically enforce NVIDIA-recommended configurations for memory, CPU, and networking to help ensure AI-optimized virtual machines.

Orchestration: Treating Infrastructure as Code

The integration between Nutanix AHV and NKP allows platform engineers to automate the deployment of complex, GPU-dense environments with ease.

VM Profile-Based Node Pools: Platform engineers can now treat complex hardware as code. By selecting an "AI Optimized Worker Pool" in NKP, the system automatically provisions the correct HGX-backed VMs, reducing the need for deep hardware knowledge at the Kubernetes layer.
Air-Gapped Lifecycle Management: For highly regulated sectors, the Nutanix Agentic AI full stack solution allows for fully disconnected installations. You can automate driver updates and network optimization for NVIDIA GPU and Network Operators without ever exposing the cluster to the internet.
Sovereign Compliance (FIPS/STIG): NKP enforces FIPS 140-3 and STIG compliance at both the OS and Kubernetes layers, meeting the strictest regulatory requirements for government and sovereign cloud clients.

Predictable High-Performance via Advanced Networking

Agentic AI demands a high-performance, zero trust foundation that maximizes throughput and secures data flow across the AI Factory.

Feature	Impact on AI Workloads
Traffic Segmentation	Nutanix Flow splits traffic to maximize throughput—dedicating InfiniBand/Ethernet to GPU training while BlueField-3 DPUs handle data ingest.
GPU Direct RDMA	Allows GPUs across different hosts to exchange data directly over the network, bypassing the host CPU to significantly reduce latency.
Infrastructure Offload	We offload the networking data path, utilizing BlueField-3 (BF3) DPUs running in NIC mode, to efficiently manage storage and management traffic. This frees up CPU cycles on the host for compute and ensures consistent networking performance.

Delivering on the Promise of AI Factories

Nutanix Agentic AI delivers the foundational platform for AI. At its core, Nutanix AHV delivers high-performance by leveraging Topology-Aware Scheduling and 8-GPU passthrough so massive models get the raw compute they need without sacrificing the strict multi-tenancy, security, and lifecycle governance of a virtualized environment.

Working in tandem with AHV to address bottlenecks, Nutanix Flow continues to provide advanced software-defined networking and security. We also now support PCI passthrough and 1:1 GPU-to-NIC mapping with InfiniBand support on NVIDIA CX7 hardware to provide VMs with maximum throughput and ultra-low latency. Combined with the orchestration of AI workloads with NKP, organizations building Enterprise AI Factories no longer need to compromise between bare-metal speed and secure control.

With this seamlessly integrated stack, platform engineering teams are fully equipped to deploy, secure, and scale the next generation of intelligent, compliant applications without sacrificing performance.

Try it for yourself on Nutanix Test Drive

©2026 Nutanix, Inc. All rights reserved. Nutanix, the Nutanix logo and all Nutanix product and service names mentioned are registered trademarks or trademarks of Nutanix, Inc. in the United States and other countries. All other brand names mentioned are for identification purposes only and may be the trademarks of their respective holder(s).