Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles

11/29/2019
by   Pin Wang, et al.
0

Reinforcement Learning algorithms have recently been proposed to learn time-sequential control policies in the field of autonomous driving. Direct applications of Reinforcement Learning algorithms with discrete action space will yield unsatisfactory results at the operational level of driving where continuous control actions are actually required. In addition, the design of neural networks often fails to incorporate the domain knowledge of the targeting problem such as the classical control theories in our case. In this paper, we propose a hybrid model by combining Q-learning and classic PID (Proportion Integration Differentiation) controller for handling continuous vehicle control problems under dynamic driving environment. Particularly, instead of using a big neural network as Q-function approximation, we design a Quadratic Q-function over actions with multiple simple neural networks for finding optimal values within a continuous space. We also build an action network based on the domain knowledge of the control mechanism of a PID controller to guide the agent to explore optimal actions more efficiently.We test our proposed approach in simulation under two common but challenging driving situations, the lane change scenario and ramp merge scenario. Results show that the autonomous vehicle agent can successfully learn a smooth and efficient driving behavior in both situations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2018

Maneuver Control based on Reinforcement Learning for Automated Vehicles in An Interactive Environment

Operating a robot safely and efficiently can be considerably challenging...
research
03/25/2018

Autonomous Ramp Merge Maneuver Based on Reinforcement Learning with Continuous Action Space

Ramp merging is a critical maneuver for road safety and traffic efficien...
research
08/15/2020

Autonomous Braking and Throttle System: A Deep Reinforcement Learning Approach for Naturalistic Driving

Autonomous Braking and Throttle control is key in developing safe drivin...
research
06/05/2019

Continuous Control for Automated Lane Change Behavior Based on Deep Deterministic Policy Gradient Algorithm

Lane change is a challenging task which requires delicate actions to ens...
research
08/18/2023

Integrating Expert Guidance for Efficient Learning of Safe Overtaking in Autonomous Driving Using Deep Reinforcement Learning

Overtaking on two-lane roads is a great challenge for autonomous vehicle...
research
09/15/2019

Driving in Dense Traffic with Model-Free Reinforcement Learning

Traditional planning and control methods could fail to find a feasible t...
research
05/19/2022

Image-Based Conditioning for Action Policy Smoothness in Autonomous Miniature Car Racing with Reinforcement Learning

In recent years, deep reinforcement learning has achieved significant re...

Please sign up or login with your details

Forgot password? Click here to reset