research
∙
11/07/2022
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
In the field of reinforcement learning, because of the high cost and ris...
research
∙
04/13/2021