ASSET Seminar: “How do LLMs generalize on out-of-distribution tasks? insights from model’s internal representations”
Amy Gutmann Hall, Room 414 3333 Chestnut Street, Philadelphia, United StatesA mystery of large language models (LLMs) is their ability to solve novel tasks, notably through a few demonstrations in the prompt (in-context learning). Such tasks often require the model […]