ESE Guest Seminar – “Safe Offline RL for Constrained Markov Decision Process: Theory and Practice”
Many constrained sequential decision-making processes such as safe AV navigation, wireless network control, caching, cloud computing, etc., can be cast as Constrained Markov Decision Processes (CMDP). Reinforcement Learning (RL) algorithms […]