Loading Events

« All Events

Fall 2024 GRASP Seminar: Yunzhu Li, Columbia University, “Foundation Models for Robotic Manipulation: Opportunities and Challenges”

November 18 at 11:00 AM - 12:00 PM

*This seminar will be held in-person in Levine 307 as well as virtually via Zoom.

ABSTRACT

Foundation models, such as GPT-4 Vision, have marked significant achievements in the fields of natural language and vision, demonstrating exceptional abilities to adapt to new tasks and scenarios. However, physical interaction—such as cooking, cleaning, or caregiving—remains a frontier where foundation models and robotic systems have yet to achieve the desired level of adaptability and generalization. In this talk, I will discuss the opportunities for incorporating foundation models into classic robotic pipelines to endow robots with capabilities beyond those achievable with traditional robotic tools. The talk will focus on three key improvements in (1) task specification, (2) low-level, and (3) high-level scene modeling. The central idea behind this research is to translate the commonsense knowledge embedded in foundation models into structural priors that can be integrated into robot learning systems. This approach leverages the strengths of different modules (e.g., VLM for task interpretation and constrained optimization for motion planning), achieving the best of both worlds. I will demonstrate how such integration enables robots to interpret instructions provided in free-form natural language, and how foundation models can be augmented with additional memory mechanisms, such as an action-conditioned scene graph, to handle a wide range of real-world manipulation tasks. Toward the end of the talk, I will discuss the limitations of the current foundation models, challenges that still lie ahead, and potential avenues to address these challenges.

Yunzhu Li

Columbia University

Yunzhu Li is an Assistant Professor of Computer Science at Columbia University. Before joining Columbia, he was an Assistant Professor at UIUC CS and spent time as a Postdoc at Stanford, collaborating with Fei-Fei Li and Jiajun Wu. Yunzhu earned his PhD from MIT under the guidance of Antonio Torralba and Russ Tedrake. His work lies at the intersection of robotics, computer vision, and machine learning, with the goal of helping robots perceive and interact with the physical world as dexterously and effectively as humans do. Yunzhu’s work has been recognized with the Best Paper Award at ICRA, the Best Systems Paper Award, and as a Finalist for the Best Paper Award at CoRL. Yunzhu is also the recipient of the Sony Faculty Innovation Award, the Adobe Research Fellowship, and was selected as the First Place Recipient of the Ernst A. Guillemin Master’s Thesis Award in AI and Decision Making at MIT. His research has been published in top journals and conferences, including Nature, Science, RSS, NeurIPS, and CVPR, and featured by major media outlets, including CNN, BBC, The Wall Street Journal. His research is supported by NSF, DARPA, Amazon, Google, Sony, and Toyota.

Details

Date:
November 18
Time:
11:00 AM - 12:00 PM
Event Category:
Website:
https://www.grasp.upenn.edu/events/fall-2024-grasp-seminar-yunzhu-li/

Organizer

General Robotics, Automation, Sensing and Perception (GRASP) Lab
Email
grasplab@seas.upenn.edu
View Organizer Website

Venue

Levine 307
3330 Walnut Street
Philadelphia, PA 19104 United States
+ Google Map