Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies
Standard reinforcement learning methods aim to master one way of solving...
Identifying Distinct, Effective Treatments for Acute Hypotension with SODA-RL: Safely Optimized Diverse Accurate Reinforcement Learning
Hypotension in critical care settings is a life-threatening emergency th...
Muhammad A. Masoodis this you? claim profile