Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry

02/23/2021
by   Siddharth Mysore, et al.
3

Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures. This case study explores the performance impact of network sizes when considering actor and critic architectures independently. By relaxing the assumption of architectural symmetry, it is often possible for smaller actors to achieve comparable policy performance to their symmetric counterparts. Our experiments show up to 97 reduction of 64 practical benefits of reducing actor complexity, we believe configurations of actors and critics are aspects of actor-critic design that deserve to be considered independently.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset