Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

06/04/2017
by   Tomoki Nishi, et al.
0

In many robotic applications, some aspects of the system dynamics can be modeled accurately while others are difficult to obtain or model. We present a novel reinforcement learning (RL) method for continuous state and action spaces that learns with partial knowledge of the system and without active exploration. It solves linearly-solvable Markov decision processes (L-MDPs), which are well suited for continuous state and action spaces, based on an actor-critic architecture. Compared to previous RL methods for L-MDPs and path integral methods which are model based, the actor-critic learning does not need a model of the uncontrolled dynamics and, importantly, transition noise levels; however, it requires knowing the control dynamics for the problem. We evaluate our method on two synthetic test problems, and one real-world problem in simulation and using real traffic data. Our experiments demonstrate improved learning and policy performance.

READ FULL TEXT
research
04/22/2022

TASAC: a twin-actor reinforcement learning framework with stochastic policy for batch process control

Due to their complex nonlinear dynamics and batch-to-batch variability, ...
research
11/11/2019

Real-Time Reinforcement Learning

Markov Decision Processes (MDPs), the mathematical framework underlying ...
research
06/10/2016

Policy Networks with Two-Stage Training for Dialogue Systems

In this paper, we propose to use deep policy networks which are trained ...
research
09/11/2019

Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning

Cumulative entropy regularization introduces a regulatory signal to the ...
research
07/14/2017

Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic

Freeway merging in congested traffic is a significant challenge toward f...
research
04/28/2022

Actor-Critic Scheduling for Path-Aware Air-to-Ground Multipath Multimedia Delivery

Reinforcement Learning (RL) has recently found wide applications in netw...
research
02/18/2021

Learning Memory-Dependent Continuous Control from Demonstrations

Efficient exploration has presented a long-standing challenge in reinfor...

Please sign up or login with your details

Forgot password? Click here to reset