2026-01-06 03:29:30

Last year at CES, Jensen Huang took the stage wearing his signature leather jacket and told a grand story — the era of physical AI is here. Blackwell chips, million-token long context, agent processing, autonomous driving decisions… all sound promising.

But this time is different. This year, NVIDIA is seriously tackling a real problem: how to truly reduce the cost of inference.

This is crucial. Because since open-source inference models like DeepSeek R1 became popular, the entire industry has realized — when collaboration is truly unleashed, technology diffusion happens at an incredible speed. Although open-source models are still about half a year slower, every six months they can catch up, with downloads and usage skyrocketing.

Reality is forcing NVIDIA to change its strategy. Having chips alone is not enough; the entire infrastructure for inference — computing power, networks, storage — must be completed to lower inference costs from the source. This isn’t just about training scores for super-large models, but about truly embedding AI capabilities into real-world scenarios like autonomous driving and robotics.

This means the industry landscape is quietly changing. The significant reduction in inference costs will directly impact hardware demand, and the level of infrastructure development will become a key factor in determining who can deploy applications quickly.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

16 Likes

Reward
16
6
Repost
Share

Comment

0/400

ImpermanentPhobia

· 01-08 15:39

Open-source models get a funding round every half year; it's true that Mr. Huang is panicking.

View OriginalReply0

NotGonnaMakeIt

· 01-08 13:51

DeepSeek has directly awakened NVIDIA; even the most advanced chips must cut costs through real effort.

View OriginalReply0

Token_Sherpa

· 01-06 03:59

lmao nvidia finally realizing the moat was never the chip, it's the stack... deepsee proved what we all knew—open source moves different when incentives align. infrastructure velocity > raw compute now, that's the actual play here

Reply0

Rugman_Walking