Last year at CES, Jensen Huang took the stage wearing his signature leather jacket and told a grand story — the era of physical AI is here. Blackwell chips, million-token long context, agent processing, autonomous driving decisions… all sound promising.



But this time is different. This year, NVIDIA is seriously tackling a real problem: how to truly reduce the cost of inference.

This is crucial. Because since open-source inference models like DeepSeek R1 became popular, the entire industry has realized — when collaboration is truly unleashed, technology diffusion happens at an incredible speed. Although open-source models are still about half a year slower, every six months they can catch up, with downloads and usage skyrocketing.

Reality is forcing NVIDIA to change its strategy. Having chips alone is not enough; the entire infrastructure for inference — computing power, networks, storage — must be completed to lower inference costs from the source. This isn’t just about training scores for super-large models, but about truly embedding AI capabilities into real-world scenarios like autonomous driving and robotics.

This means the industry landscape is quietly changing. The significant reduction in inference costs will directly impact hardware demand, and the level of infrastructure development will become a key factor in determining who can deploy applications quickly.
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 6
  • Repost
  • Share
Comment
0/400
ImpermanentPhobiavip
· 01-08 15:39
Open-source models get a funding round every half year; it's true that Mr. Huang is panicking.
View OriginalReply0
NotGonnaMakeItvip
· 01-08 13:51
DeepSeek has directly awakened NVIDIA; even the most advanced chips must cut costs through real effort.
View OriginalReply0
Token_Sherpavip
· 01-06 03:59
lmao nvidia finally realizing the moat was never the chip, it's the stack... deepsee proved what we all knew—open source moves different when incentives align. infrastructure velocity > raw compute now, that's the actual play here
Reply0
Rugman_Walkingvip
· 01-06 03:57
Haha, DeepSeek really cornered Boss Huang. Now it's time to get serious.
View OriginalReply0
MetaverseLandlordvip
· 01-06 03:53
It's rolled up now, Huang Renxun is really scared this time. DeepSeek has cornered him.
View OriginalReply0
Ser_Liquidatedvip
· 01-06 03:41
DeepSeek really stirred the waters; nothing is more real than the pressure of costs.
View OriginalReply0
  • Pin