Orthogonal Policy Gradient and Autonomous Driving Application

11/15/2018
by   Mincong Luo, et al.
0

One less addressed issue of deep reinforcement learning is the lack of generalization capability based on new state and new target, for complex tasks, it is necessary to give the correct strategy and evaluate all possible actions for current state. Fortunately, deep reinforcement learning has enabled enormous progress in both subproblems: giving the correct strategy and evaluating all actions based on the state. In this paper we present an approach called orthogonal policy gradient descent(OPGD) that can make agent learn the policy gradient based on the current state and the actions set, by which the agent can learn a policy network with generalization capability. we evaluate the proposed method on the 3D autonomous driving enviroment TORCS compared with the baseline model, detailed analyses of experimental results and proofs are also given.

READ FULL TEXT
research
01/12/2020

Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

With the implementation of reinforcement learning (RL) algorithms, curre...
research
11/28/2018

Deep Reinforcement Learning for Autonomous Driving

Reinforcement learning has steadily improved and outperform human in lot...
research
05/22/2017

Deep Reinforcement Learning with Relative Entropy Stochastic Search

Many reinforcement learning methods for continuous control tasks are bas...
research
04/15/2021

Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving

Due to the limited smartness and abilities of machine intelligence, curr...
research
06/25/2020

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

In recent years, Deep Reinforcement Learning (DRL) algorithms have achie...
research
10/11/2016

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Autonomous driving is a multi-agent setting where the host vehicle must ...
research
09/09/2019

Adversarial Policy Gradient for Deep Learning Image Augmentation

The use of semantic segmentation for masking and cropping input images h...

Please sign up or login with your details

Forgot password? Click here to reset