🔥NVIDIA open-sources Lyra 2.0, supporting the generation of explorable 3D worlds from a single photo and importing into robot simulators


NVIDIA releases Lyra 2.0, an open-source framework that generates explorable 3D worlds from a single image. After a user inputs a photo, Lyra 2.0 first creates a roaming video controlled by camera trajectory, then reconstructs the video into 3D Gaussian splats and mesh models, which can be directly imported into game engines and simulators for real-time rendering. The model weights and code are open-sourced under the Apache 2.0 license on Hugging Face and GitHub, allowing commercial use. The core technological breakthrough addresses two degradation issues in long-distance roaming: one is spatial forgetting, achieved by maintaining 3D geometric information for each frame and retrieving historical frames to establish correspondences; the second…
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pin