What is the Nutanix Agentic AI solution?

The Nutanix Agentic AI solution abstracts complexity and creates a seamless bridge from the agentic AI builders to the AI factory operators. This full-stack solution offers a cloud operating model for AI factory operators by simplifying operations, maximizing performance and security, and optimizing token costs. At the same time, it enables the agentic AI builders to focus on innovation, model management, and rapid inference scaling.

What is the core challenge the Nutanix Agentic AI solution solves?

Agentic AI builders face a high degree of "innovation friction" as they navigate a fragmented landscape of models, tools, and data silos instead of focusing on building intelligence. Developers lack a unified, secure path to leverage diverse LLMs and open-source tools to rapidly evolving applications from simple chat interfaces into sophisticated agentic AI capable of driving real business outcomes. For AI factory operators, the biggest challenge is delivering business value measured in terms of time to tokens and cost per token due to operations complexity in AI factories such as: Complexity in managing diverse and rapidly evolving AI hardware (GPUs, networking, storage), Complexity of providing shared access to critical AI infrastructure while ensuring secure access to model and data, and complying with sovereignty requirements Complexity of consistently delivering maximum performance, while optimizing resource utilization across the full AI factory. Complexity of managing the lifecycle of fragmented, bespoke point solutions supporting AI factory operations

What is the "Cloud Operating Model" for Agentic AI?

The Cloud Operating Model is Nutanix’s approach to bridging the gap between AI developers and infrastructure teams. Instead of managing fragmented point solutions or complex bare-metal clusters, this model provides a unified, full-stack environment. It allows operators to govern AI infrastructure (GPUs, DPUs, and storage) with the same ease as a cloud service, while giving developers instant, secure access to the tools and models they need to scale thousands of intelligent agents.

How does Nutanix help reduce the "cost per token"?

Nutanix optimizes token economics through several integrated efficiencies: Topology-Aware Optimization: The AHV hypervisor automatically places workloads across GPU-dense servers to maximize hardware alignment. Resource Offloading: Using DPUs (Data Processing Units) to handle networking and security tasks frees up GPU cycles specifically for inference. Smart Storage: Nutanix Unified Storage provides a high-capacity tier for KV Cache offloading, which saves expensive GPU memory and allows for larger context windows without a performance penalty.

Why does Nutanix recommend virtual machines (VMs) over bare-metal Kubernetes for AI?

While bare-metal was the standard for initial model training, it often lacks the security and isolation required for scaling agents in an enterprise. Nutanix uses VM-based Kubernetes infrastructure to provide: Superior Isolation: Stronger multi-tenancy and security boundaries between different AI workloads. Management at Scale: Easier lifecycle management and resource allocation. Bare-Metal Performance: By leveraging DPU acceleration and topology awareness, Nutanix delivers the speed of bare metal with the governance of a virtualized environment.

What is the Nutanix Enterprise AI (NAI) Gateway?

The NAI Gateway acts as a secure "front door" for all AI models. It provides a unified inference endpoint that allows enterprises to manage cloud-hosted models and private LLMs in one place. Key features include: Governance: Token-based rate limiting to prevent "bill shock." Observability: Full visibility into who is consuming resources and how. Connectivity: Support for the Model Context Protocol (MCP), which allows agents to securely connect to private enterprise data and tools.

How does this solution accelerate the work of Agentic AI builders?

The solution reduces "innovation friction" by providing a developer-centric environment where they can bypass infrastructure setup. Through the Nutanix Kubernetes Platform (NKP), builders gain access to a rich AI catalog including: Pre-built open-source tools (Notebooks, Vector Databases, MLOps engines). Instant deployment of NVIDIA NIMs and the NVIDIA Nemotron family of models. 1-click secure inference endpoints and turnkey access to fine-tuning services.

How does Nutanix Unified Storage (NUS) support AI and next-generation applications?

Nutanix Unified Storage provides a scalable, high-performance data platform purpose-built for modern workloads like AI and next-gen apps. Key capabilities include: Ultra-fast read throughput and dense all-NVMe capacity to handle massive datasets for AI pipelines, including Inferencing and Retrieval-Augmented Generation (RAG) . Integration with Nutanix Kubernetes Platform , enabling seamless deployment of containerized AI/ML pipelines and cloud-native applications. Multi-protocol data access, simplifying storage for diverse workloads and accelerating innovation.

代理型 AI

您的 AI 工廠需要真正有效的基礎架構

大多數 AI 平台都承諾提供大規模 AI 服務，但實際上卻帶來複雜性。Nutanix Agentic AI 是一款全堆疊軟體解決方案，提供雲端作業模型，協助組織建立、運作和管理 AI 工廠。透過與 NVIDIA 加速運算生態系統的整合，該解決方案可簡化運作、達到最高效能和安全性，並最佳化 GPU 利用率和詞元成本。

閱讀部落格文章

適用於 AI 工廠的雲端作業模型

Nutanix 提供專為在 AI 工廠上執行的 AI 同事時代所設計的雲端作業模式。透過抽象化複雜性，並幫助 IT 決策者平衡效能、安全性和成本，Nutanix Agentic AI 解決方案不僅簡化營運；更從根本上最佳化 AI 的經濟效益。

將每個詞元的成本降到最低

全堆疊解決方案提供智慧路由、推論擴展、拓撲感知資源配置和最佳化的 GPU 消耗，進而降低每個詞元的成本。

確保企業級的安全與控制

Nutanix Enterprise AI 為您的大型語言模型端點提供安全部署和企業控制，同時對 Nutanix AHV（虛擬化）和 Nutanix Flow（網路和安全）的增強功能，帶來更優越的隔離和安全性。

加速開發人員的效率

不會造成基礎架構延遲，即可從概念階段快速過渡至生產環境。Nutanix Enterprise AI 透過提供智慧模型路由和一鍵式安全推論端點，以及安全且統包的 Model Context Protocol 伺服器存取，讓 AI 工具在使用上更加無縫。Nutanix Kubernetes 平台為代理型 AI 和開發人員提供快速入門環境，其中包含豐富的 AI 服務目錄和內建的私人資料存取權限。

全面落實代理型 AI

Nutanix Agentic AI 解決方案專為與 NVIDIA 認證的 AI 工廠無縫整合與相輔相成而設計，憑藉深厚的合作夥伴關係，提供包括 Cisco、Dell 和 Supermicro 等主要 OEM 硬體製造商的完整解決方案。

關鍵整合元件

AI 服務和 Kubernetes 平台

這個以開發人員為中心的雲端原生環境，可讓團隊繞過基礎架構的設定，立即擴充生產環境等級的代理型 AI 應用程式，實現可預測的詞元經濟效益。

基礎架構最佳化與安全

透過虛擬機器形式協調加速運算的強大力量，提供最高的效能和安全性，進而降低每個詞元的成本。

適用於 AI 的基礎資料服務

提供高效能資料結構，透過持續且由 GPU 加速的轉換，直接於儲存叢集內橋接訓練和推論。

AI 服務和 Kubernetes 平台

先進的 AI Gateway 和推論服務

統一、安全的推論端點可讓企業在使用私有大型語言模型的同時，使用雲端託管的模型（和詞元額度），並提供一致的驗證、可觀測性和基於詞元的速率限制。

模型上下文協定支援與微調

Nutanix Enterprise AI 擴展其現有強大的模型即服務（MaaS）功能，使代理能夠安全地連接企業工具與資料來源。

開放式 Kubernetes 平台搭載豐富的 AI 目錄

使用預先驗證的開放原始碼 AI 服務目錄，包括筆記本、向量資料庫和 MLOps 引擎，將 Agentic 應用程式從概念階段快速部署至生產階段，而不會造成基礎架構延遲。該解決方案與 NVIDIA AI Enterprise 原生整合，讓開發人員可立即部署 NVIDIA NIM（包括 Nemotron），加速生產開發中的高效能 AI 應用程式。

基礎架構最佳化與安全

拓樸感知最佳化

Nutanix AHV 虛擬機器監控程式透過自動最佳化 GPU 密集伺服器的工作負載配置，確保硬體嚴格一致，無需手動調整基礎架構，實現最高效能、安全性與資源利用率。

DPU 加速的零信任網路

利用 Nutanix Flow 與全新的 DPU 卸載功能，提供裸機的原始速度，兼具虛擬化環境的精密隔離功能，搭配最大化輸送量的高效能零信任網路基礎，同時確保資料在 AI 工廠間的安全且可靠地流動。

實體隔離生命週期管理

該解決方案支援整個 NKP 平台、NVIDIA GPU 和網路營運商的完全離線安裝，可讓高度管制或國防部門環境自動化驅動程式更新和網路最佳化，且無需暴露叢集於網際網路。

適用於 AI 的基礎資料服務

線性可擴展性

作為 NVIDIA 企業級認證的 AI 資料平台，Nutanix 統一儲存可以在數以千計的 GPU 用戶端上提供高速讀寫效能，確保資料可用性和運算速度同步擴展。

進階輸送量

透過利用 NFS over RDMA 和即將推出的 S3 over RDMA 提供低延遲的資料路徑，確保 GPU 永遠不會「資料匱乏」。

成本最佳化

透過提供高容量層級的 KV 快取卸載，降低每個詞元的總成本，並釋放關鍵 GPU 記憶體，讓您能處理更大範圍的上下文窗口和更多的並行使用者，且不會影響效能。

Nutanix 備受以下企業的信賴

案例研究

坎培拉大學

「我們正在使用 Nutanix 調整我們的 IT，以支援全校（包括我們的研究中心）的 AI 和機器學習。它也有助於遠端提供學生和教師所需的應用程式。」

— 坎培拉大學資訊長 Matt Carmichael

Nutanix 雲端基礎架構(NCI, Nutanix Cloud Infrastructure):AHV 虛擬化, AOS 儲存
Nutanix 雲端管理器(NCM，Nutanix Cloud Manager):Xi Beam
Region:APAC
Use Cases:AI ML, 大數據, 私有雲與混合雲, 終端使用者運算（EUC）, 資料庫和資料庫即服務
產品:Nutanix 雲端基礎架構(NCI, Nutanix Cloud Infrastructure), Nutanix 雲端管理器(NCM，Nutanix Cloud Manager)
產業:教育領域
資源:案例研究

2024年2月7日

Case Study

IndianOil

「得益於 Nutanix 對 AI 工作負載的支援，研發的工作效率至少提升 20%。」

- 印度石油研發中心資訊系統總經理 N.K. Malik

Industries:Government, Oil & Gas
Key Play:VMware Alternative Broadcom Compete
Nutanix Central:Prism
Nutanix Cloud Infrastructure (NCI):AHV Virtualization, Flow Network Security
Products:Nutanix Central, Nutanix Cloud Infrastructure (NCI)
Region:APAC
Resource Type:Case Study
Use Cases:AI ML, Business Continuity & Disaster Recovery, Private Cloud, Sustainability & IT

2026年1月12日

案例研究

印尼選舉委員會（KPU）

「AI 將我們的法律研究時間縮短至不到 3 分鐘，大幅提升團隊滿意度，並確保即時存取準確的選舉記錄。」

- Andre Putra Hermawan，Kepala Divisi Pusat Data dan Teknologi Informasi（PUSDATIN）

Use Cases:AI ML, Private Cloud
產品:Nutanix Enterprise AI (NAI), Nutanix Kubernetes 平台（NKP), Nutanix 統一儲存管理, Nutanix 資料庫服務, Nutanix 雲端基礎架構(NCI, Nutanix Cloud Infrastructure)
產業:政府
資源:案例研究

2025年12月1日

更多客戶案例

瞭解熱門資源

Nutanix 推出 Nutanix Agentic AI 全堆疊軟體解決方案，充分發揮企業級 AI 工廠的潛力

Nutanix Agentic AI 是一款全堆疊軟體解決方案，專為協助客戶加速採用 Agentic AI 實現企業轉型而打造。

Nutanix:Press Releases
Use Cases:AI ML
Years:2026

2026年3月16日

像執行其他工作負載一樣執行 AI

為了輕鬆自信地部署、擴展和管理 AI 工作負載，企業可透過專注於關鍵成功要素，並運用現有的 IT 基礎和技能，將基礎架構的複雜性降至最低。

Blog Post

最佳化 AI 工作負載的網路效能：Nutanix 和 NVIDIA 的合作方法

AI 工作負載需要高效能、安全可靠的網路基礎架構才能有效運作。Nutanix 提供高度最佳化的網路功能，專為滿足這些需求而設計，為 AI 應用提供強大的基礎。

Products:Nutanix Cloud Platform (NCP)
Resource Type:Blog Post
Use Cases:AI ML

2025年10月28日

開始產品試用

使用 Nutanix Enterprise AI 大規模執行 AI 推論

開始試用 AI

準備好觀看展示了嗎？

歡迎與專家洽詢，以瞭解 Nutanix
如何幫助你在混合多雲端環境中擴展 AI。

常見問答

Nutanix Agentic AI 解決方案抽象化複雜度，並架起代理型 AI 建構者和 AI 工廠營運商之間的無縫橋樑。此全堆疊解決方案透過簡化運作、最大化效能和安全性，以及最佳化詞元成本，為 AI 工廠營運商提供雲端作業模式。同時，它讓代理型 AI 建構者能專注於創新、模型管理和快速推論擴展。

Agentic AI 建構者面臨高度的「創新摩擦」，因為他們需要在模型、工具和資料孤島等碎片化環境中不斷摸索，而非專注於建立智慧。開發人員則缺乏統一且安全的路徑，無法利用多元大型語言模型和開放原始碼工具，將應用程式從簡單的聊天介面快速演進為精密的代理型 AI，以推動實際的業務成果。

對於 AI 工廠營運者而言，最大的挑戰是如何提供以取得詞元的時間和每個詞元的成本來衡量的商業價值，這是由於 AI 工廠本身的運作複雜性，例如：

管理多樣且快速演進的 AI 硬體（GPU、網路、儲存）的複雜性；
提供共享存取至關鍵 AI 基礎架構的複雜性，同時確保安全存取模型和資料，並遵守主權要求
持續提供最高效能的複雜性，同時最佳化整個 AI 工廠的資源利用。
管理支援 AI 工廠營運的碎片化、客製化單點解決方案生命週期的複雜性

雲端作業模型是 Nutanix 用來彌合 AI 開發者與基礎架構團隊之間鴻溝的方法。此模型不再管理分散的單點解決方案或複雜的裸機叢集，而是提供統一的全端環境。它可讓營運商像管理雲端服務一樣，輕鬆管理 AI 基礎架構（GPU、DPU 和儲存設備），同時讓開發人員即時、安全地存取擴充數千個智慧型代理所需的工具和模型。

Nutanix 透過多項整合效率最佳化詞元經濟效益：

拓樸感知最佳化：AHV 虛擬機器管理程式會自動將工作負載分配至 GPU 密集的伺服器上，以最大化硬體資源配置。
資源卸載：使用 DPU（資料處理單元）處理網路和安全任務，可釋放 GPU 的週期，專門用於推論。
智慧儲存：Nutanix 統一儲存提供高容量的 KV 快取卸載層，節省昂貴的 GPU 記憶體，並允許更大的上下文窗口，而不會影響效能。

雖然裸機為初期模型訓練的標準，但其通常缺乏企業中代理擴展所需的安全性和隔離性。Nutanix 使用基於虛擬機器的 Kubernetes 基礎架構提供：

卓越的隔離：不同 AI 工作負載之間擁有更強大的多租戶和安全性界限。
規模化管理：更輕鬆的生命週期管理和資源分配。
裸機效能：透過利用 DPU 加速和拓撲感知技術，Nutanix 可於虛擬化環境中提供裸機的速度。

NAI Gateway 可作為所有 AI 模型的安全「前門」。它提供統一的推論端點，讓企業可以在同一處管理雲端託管模型和私有大型語言模型。關鍵功能包括：

治理：基於詞元的速率限制，以防止「高昂的帳單」。
可觀測性：全面了解誰在消耗資源及其消耗方式。
連接性：支援模型上下文協定（MCP），讓代理程式安全地連接至私有企業資料和工具。

該解決方案提供以開發人員為中心的環境來減少「創新摩擦」，讓他們可繞過基礎架構設定。透過 Nutanix Kubernetes 平台（NKP），建構者可存取豐富的 AI 目錄，包括：

預先建構的開放原始碼工具（筆記本、向量資料庫、MLOps 引擎）。
即時部署 NVIDIA NIM 及 NVIDIA Nemotron 系列模型。
一鍵式安全推論端點與開箱即用的微調服務。

Nutanix 統一儲存提供可擴展、高效能的資料平台，專為 AI 和新一代應用程式等現代工作負載而打造。關鍵功能包括：

超快的讀取輸送量和密集的全 NVMe 容量，可處理 AI 管道的大量資料集，包括推論和檢索增強生成（RAG）。
整合 Nutanix Kubernetes 平台，實現容器化 AI/ML 管線及雲端原生應用程式的無縫部署。
多協定資料存取，簡化多元工作負載的儲存，並加速創新。

您的 AI 工廠需要真正有效的基礎架構

適用於 AI 工廠的雲端作業模型

將每個詞元的成本降到最低

確保企業級的安全與控制

加速開發人員的效率

全面落實代理型 AI

關鍵整合元件

AI 服務和 Kubernetes 平台

先進的 AI Gateway 和推論服務

模型上下文協定支援與微調

開放式 Kubernetes 平台搭載豐富的 AI 目錄

基礎架構最佳化與安全

拓樸感知最佳化

DPU 加速的零信任網路

實體隔離生命週期管理

適用於 AI 的基礎資料服務

線性可擴展性

進階輸送量

成本最佳化

Nutanix 備受以下企業的信賴

坎培拉大學

IndianOil

印尼選舉委員會（KPU）

相關產品

瞭解熱門資源

開始產品試用

準備好觀看展示了嗎？

常見問答

什麼是 Nutanix Agentic AI 解決方案？

Nutanix Agentic AI 解決方案解決的核心挑戰是什麼？

什麼是適用於 Agentic AI 的「雲端作業模型」？

Nutanix 如何幫助降低「每個詞元的成本」？

為什麼 Nutanix 建議在 AI 應用中使用虛擬機器（VM）而不是裸機 Kubernetes？

什麼是 Nutanix Enterprise AI（NAI）閘道？

此解決方案如何加速 Agentic AI 開發人員的工作流程？

Nutanix 統一儲存（NUS）如何支援人工智慧和下一代應用程式？