Spring 2022 GRASP SFI: Jason Ma, University of Pennsylvania, “Beyond Expected Reward in Offline Reinforcement Learning”
Levine 512*This will be a HYBRID Event with in-person attendance in Levine 512 and Virtual attendance via Zoom Offline reinforcement learning (RL), which uses pre-collected, reusable offline data without further environment interactions, […]