ASSET Seminar: “Beyond Photorealism: 3D Reconstruction and Generation with Multimodal and Physical Grounding”
April 22 at 12:00 PM - 1:15 PM
Organizer
Progress in 3D reconstruction and generation has accelerated rapidly, producing increasingly detailed geometry and photorealistic rendering. However, moving beyond photorealism requires models that not only look correct, but are also semantically grounded and physically plausible. This talk focuses on two complementary directions. First, multimodal grounding: 3D representations should align naturally with language and images to support cross-modal understanding, controllable generation, and intuitive editing. Second, physical grounding: reconstructions and synthesized scenes should respect constraints such as support, contact, and feasible motion, enabling reliable interaction and deployment in embodied settings, including robotics. I will present our recent work toward multimodally aligned and physics-aware 3D reconstruction and generation, and discuss the challenges that lie ahead.

