ASSET Seminar: “Towards Pluralistic Alignment: Foundations for Learning from Diverse Human Preferences”
Raisler Lounge (Room 225), Towne Building 220 South 33rd Street, PhiladelphiaAbstract: Large pre-trained models trained on internet-scale data are often not ready for safe deployment out-of-the-box. They are heavily fine-tuned and aligned using large quantities of human preference data, usually […]