Fei-Fei Li has now broken the world model into a renderer, a simulator, and a planner.


Marble can already generate both visuals and collision meshes from a single model.
This “boundary fusion” direction is even more tantalizing—and more fun—than an end-to-end black box.
View Original
CoinNetwork
Fei-Fei Li redefines the world model: physical simulation is the ultimate of spatial intelligence
Fei-Fei Li first proposed the physical framework and approach of the world model on Substack, emphasizing that the model must learn the spatiotemporal statistical structure rather than being composed solely of text. The framework divides the world model into three components: renderer, simulator, and planner, believing that a simulator capable of predicting physical feedback serves as a bridge between perception and action. In the future, the boundaries among the three will merge into a unified world model. Marble, as an early example, outputs both rendered images and collision meshes with a single model, demonstrating boundary fusion.
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned