Spring 2022 GRASP SFI: Georgios Georgakis, University of Pennsylvania, “Cross-modal Map Learning for Vision and Language Navigation”
Levine 512*This will be a HYBRID Event with in-person attendance in Levine 512 and Virtual attendance via Zoom We consider the problem of Vision-and-Language Navigation (VLN) in previously unseen realistic indoor environments. Arguably, the biggest challenge in VLN is grounding the natural language to the visual input. The majority of current methods for VLN are trained end-to-end […]