Intel Xeon 6 enters NVIDIA DGX Rubin: A milestone in the AI inference era redefining CPU value

March 17, 2026, NVIDIA GTC 2026 Conference. Intel officially announced that the Intel Xeon 6 processor would serve as the main control processor in the NVIDIA DGX Rubin NVL8 system. At the time, this news was widely interpreted as “Intel breaking into NVIDIA’s supply chain,” but the deeper industry implications go far beyond that.

The DGX Rubin NVL8 is NVIDIA’s next-generation AI supercomputing system following the Blackwell platform, equipped with eight NVIDIA Rubin GPUs, delivering 400 petaFLOPS of NVFP4 performance. In this system, the Xeon 6 is not a supporting role—it is the system’s main control CPU, responsible for task orchestration, memory management, model security, and data throughput.

The significance of this collaboration must be understood within the context of the structural shift in the AI industry from “large-scale training” to “large-scale real-time inference.” As Jeff McVeigh, General Manager of Intel’s Data Center Strategy Program, stated: “Currently, AI development is transitioning from the accelerated phase of large-scale training to a new stage driven by intelligent AI and inference systems for comprehensive real-time reasoning.”

Meanwhile, Intel CEO Lip-Bu Tan proposed the strategic direction of “AI Next Wave bringing intelligence closer to end users” at Computex 2026. The Xeon 6 entering the DGX Rubin ecosystem is a concrete implementation of this strategy at the data center level. Analyzing from four dimensions—technological logic, strategic intent, process support, and market performance—the true significance of Intel Xeon 6 entering NVIDIA’s DGX Rubin ecosystem is unpacked.

Why did DGX Rubin choose Xeon 6?

In the early years when GPU-led narratives dominated AI computing power, the role of CPUs in AI systems was severely underestimated. But the structural changes in AI workloads are reshaping this landscape.

As enterprises shift from model training to large-scale real-time inference deployment, the metrics for AI systems are no longer solely about GPU raw throughput. The main control CPU’s core functions—memory management, task orchestration, workload distribution—directly impact the efficiency and total cost of ownership (TCO) of the entire cluster.

Specifically, for the DGX Rubin NVL8 system, the reasons for selecting Xeon 6 can be understood from several technical dimensions:

Memory Capacity and Bandwidth. Xeon 6 platforms support up to 8 TB of system memory, sufficient for large models and expanding key-value caches (KV Cache). Coupled with MRDIMM technology, memory bandwidth is tripled compared to the previous generation. In inference scenarios, the demands for model weights and context window size grow exponentially, and Xeon 6 has a clear advantage here.

PCIe and I/O Capabilities. Xeon 6 offers industry-leading PCIe 5.0 channels, supporting high-bandwidth, low-latency I/O connections, capable of scheduling multiple AI accelerators and high-speed network devices simultaneously. In GPU-dense systems like DGX Rubin, the data path bandwidth between CPU and GPU directly determines GPU utilization.

Continuity of x86 Software Ecosystem. The DGX Rubin NVL8 system continues the architecture foundation established by the Blackwell platform (including DGX B300 systems) with Intel Xeon 6776P. Enterprise customers can directly migrate existing performance optimizations and system-level experience to the new generation of AI hardware, significantly reducing deployment costs and technical risks.

Security and Confidential Computing. As AI inference scales up, end-to-end confidential computing from CPU to GPU data links becomes critical. Intel TDX (Trust Domain Extensions) provides stronger security guarantees through hardware-level isolation and remote attestation.

Priority Core Turbo and Single-Threaded Performance. Intel’s Priority Core Turbo and other technologies ensure data can be transmitted rapidly to GPUs. Strong single-threaded performance handles scheduling, orchestration, and data migration. Even as inference workloads become more complex, Xeon can keep the system running smoothly.

From these technical perspectives, it’s clear that Xeon 6’s selection was not accidental. It’s not because it’s “cheap” or “well-connected,” but because in large-scale real-time inference scenarios, the system-level value of the main control CPU is being rediscovered and revalued.

“AI Next Wave”: Lip-Bu Tan’s strategic implementation

In June 2026, at Computex Taipei, Intel CEO Lip-Bu Tan took the stage for the keynote, showcasing the innovative achievements Intel and the global ecosystem have co-created over the past decade or more. But what’s truly noteworthy isn’t just the presentation itself, but his post-speech dialogue with the media.

According to reports, Lip-Bu Tan candidly stated during Computex that the era of agentic AI “handed the CPU back its crown”—restoring the CPU’s central role. More critically, Intel is experiencing a shortage in CPU supply—market demand has outpaced Intel’s current capacity.

This statement reveals two key points:

First, the explosive growth of inference and agentic AI is creating a structural incremental demand for CPUs. Agentic AI requires CPUs to coordinate tasks, retrieve information, and manage multi-turn dialogue contexts. These tasks cannot be handled solely by GPUs—while GPUs excel at parallel computation, task orchestration, logical judgment, and state management remain CPU’s domain.

Second, Intel’s CPU business is in a favorable position amid supply-demand imbalance. UBS analysis shows that in Q1 2026, server CPU shipments increased by about 6% quarter-over-quarter and approximately 19% year-over-year, far exceeding typical seasonal declines. Ongoing procurement by large-scale cloud providers is absorbing Intel’s capacity.

Lip-Bu Tan further elaborated on this strategic direction at Vista Equity Partners’ annual meeting. Since taking office 13 months ago, he has been restructuring Intel’s roadmap around the priorities of the AI new era. The core logic is: the value creation of AI is shifting from “training compute” to “inference intelligence,” and large-scale deployment of inference requires collaboration between CPUs and GPUs—precisely Intel’s core strength.

The entry of Xeon 6 into DGX Rubin is a concrete reflection of this strategy at the hardware ecosystem level. It’s not a one-off “design win,” but a landmark event in Intel’s re-establishment of CPU system-level value in the AI inference era.

18A process: supporting AI’s manufacturing backbone

If Xeon 6 entering DGX Rubin is Intel’s breakthrough at the “front end” of the AI market, then the advancement of the 18A process is the “back end” manufacturing support. Together, they form a complete logical chain for Intel’s AI counteroffensive.

On June 16, 2026, at the VLSI International Symposium, Intel announced that the first performance-enhanced version of the 18A family—Intel 18A-P—has entered risk trial production. This is a highly watched milestone: the trial production of 18A-P indicates that Intel’s advanced process roadmap is progressing on schedule, not delayed.

In terms of technical parameters, 18A-P offers a 9% performance boost at the same power consumption, or an 18% reduction in power at the same performance. Thermal performance improves by 20-40%, and it is fully compatible with 18A design rules, allowing customers to reuse existing IP and design flows. The 18A process itself employs GAA (Gate-All-Around) transistors and backside power delivery technology, reducing wiring area by 11% compared to similar front-interconnect techniques, with a tenfold reduction in dynamic voltage drop.

From an industry perspective, the importance of 18A lies not just in the performance numbers but in whether Intel can deliver on its promised schedule. Over the past year, 18A has completed critical stages including design finalization, customer tapeouts, and internal product integration. For foundry clients, the trial production schedule is more crucial than transistor performance—this is the foundation for establishing Intel as a “trusted second supplier of advanced process technology.”

Public information indicates that Intel has secured chip manufacturing orders based on the 18A process from Microsoft. Nvidia, Broadcom, and Apple are also in testing and evaluation phases. Reports suggest Google has already ordered over 3 million TPUs manufactured on 18A, with production planned to start in 2028; NVIDIA remains in testing, evaluating whether Intel’s process meets their needs.

For Xeon 6, the significance of the 18A process is that it provides a predictable manufacturing upgrade path for future generations of Xeon processors. If 18A and 18A-P can be mass-produced on schedule and attract external customers, Intel’s competitive position in AI server CPUs will be further strengthened.

Market validation: data confirming Intel’s 2026 revival

All strategic narratives ultimately require market data for validation. Since 2026, Intel’s market performance has provided quantifiable support for the above analysis.

As of June 22, 2026, INTC traded at $133.99, with a market cap exceeding $670 billion. This price is close to the mid-2025 low of nearly $19, with a cumulative increase of over 600%. Since the start of 2026, INTC has risen over 260%. On Friday, June 19, INTC closed at $133.79, with a 10.5% intraday gain, hitting new all-time highs.

The stock price rally is backed by verifiable fundamentals. In Q1 2026, Intel reported $13.6 billion in revenue, up 7% year-over-year, marking the sixth consecutive quarter beating market expectations. Non-GAAP EPS was $0.29, close to breakeven market consensus.

Most notably, the business structure is shifting. Data center and AI revenue reached $5.1 billion, up 22% YoY, becoming the fastest-growing segment. In contrast, Client Computing Group revenue was $7.7 billion, up only 1% YoY. This divergence clearly indicates that Intel’s valuation is no longer anchored solely to the PC cycle but increasingly tied to AI infrastructure expansion.

Analysts’ outlooks have also shifted. Bank of America raised Intel’s rating to “Buy” on June 6, with a target price of $135, and upwardly revised 2026–2028 non-GAAP EPS estimates to $1.06, $1.72, and $2.53 respectively. Wells Fargo raised its target from $85 to $110 on June 1. Mizuho increased its target from $124 to $128. Bloomberg consensus estimates for 2026–2028 EPS are $1.10, $1.57, and $2.37.

Of course, risks remain. Under GAAP, Intel still reported a net loss of $3.7 billion in Q1, mainly due to restructuring costs and impairments related to Mobileye. Operating cash flow was $1.1 billion, capital expenditures $3.6 billion, resulting in negative free cash flow. Foundry revenue in Q1 2026 was $5.4 billion, up 16% YoY, but still recorded a $2.4 billion operating loss.

But from a market valuation perspective, investors clearly focus more on Intel’s structural opportunities in AI inference rather than short-term accounting losses. The entry of Xeon 6 into DGX Rubin, the risk trial of 18A-P, and the continuous growth of data center AI revenue form the core narrative driving Intel’s valuation re-rating.

Conclusion: CPUs regaining center stage in AI narratives

Intel Xeon 6 entering NVIDIA’s DGX Rubin NVL8 system appears as a product-level win, but in reality, it reflects a fundamental structural shift in the AI industry—from “training era” to “inference era”—as a concrete hardware ecosystem manifestation.

In the training era, GPUs were the undisputed protagonists, with CPUs playing a supporting role. But in the inference era—especially driven by agentic AI—CPUs are back at the center. They handle task orchestration, memory management, model security, and system scheduling—functions that determine the efficiency and cost of entire AI clusters. Xeon 6’s selection is not about competing with NVIDIA in GPU dominance; it’s about the rediscovery and revaluation of CPU’s system-level value in AI inference.

Meanwhile, the risk trial of the 18A process provides Intel with manufacturing support for future generations. The 22% YoY growth in data center AI revenue and the rebound of INTC stock from $19 to $133 offer market-level validation of this strategic narrative.

Lip-Bu Tan’s “AI Next Wave bringing intelligence closer to end users” strategy is gradually unfolding through pathways like Xeon 6 entering DGX Rubin, 18A process advancement, and the rising demand for CPUs in agentic AI. For the crypto industry and broader tech investors, understanding this logical chain may be more valuable than chasing short-term stock swings.

The story of AI inference has just begun, and the CPU—long considered “obsolete”—is writing its new chapter.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned