Spring 2024 GRASP Seminar: Yutong Bai, Johns Hopkins University, “Listening to the Data: Visual Learning from the Bottom Up”
Levine 307 3330 Walnut Street, Philadelphia, PA, United States*This seminar will be held in-person in Levine 307 with virtual attendance via Zoom. ABSTRACT We introduce a novel sequential modeling approach which enables learning a Large Vision Model (LVM) without making use of any linguistic data. To do this, we define a common format, "visual sentences", in which we can represent raw images and […]