This event has passed.

xLab Seminar: “Learning to Control with Vision–Language Models”

Name: xLab Seminar: “Learning to Control with Vision–Language Models”
Start: 2024-06-05T14:00:00-04:00
End: 2024-06-05T15:30:00-04:00
Location: Towne 337

June 5, 2024 at 2:00 PM - 3:30 PM

If learning from data is valuable, can learning from big data be very valuable? It has been, so far, in vision and language, for which foundation models can be trained on web-scale data to support a plethora of downstream tasks; not so much in control, for which scalable learning remains elusive. Can information encoded in vision and language models guide reinforcement learning of control policies? In this talk, I will discuss several ways for foundation models to help agents to learn to behave. Language models can provide better context for decision-making: we will see how they can succinctly describe the world state to focus the agent on relevant features; and how they can form generalizable skills that identify key subgoals. Vision and vision–language models can help the agent to model the world: we will see how they can block visual distractions to keep state representations task-relevant; and how they can hypothesize about abstract world models that guide exploration and planning.

Roy Fox

Assistant Professor of Computer Science at UC Irvine

Roy Fox is an Assistant Professor of Computer Science at the University of California, Irvine. His research interests include theory and applications of control learning: reinforcement learning (RL), control theory, information theory, and robotics. His current research focuses on structured and model-based RL, language for RL and RL for language, and optimization in deep control learning of virtual and physical agents.

Details

Date:: June 5, 2024
Time:: 2:00 PM - 3:30 PM
Event Categories:: Colloquium, Seminar

Organizer

: xLab: Safe Autonomous Systems Lab
: View Organizer Website

Venue

: Towne 337