Decision-Making under On-Ramp merge Scenarios by Distributional Soft Actor-Critic Algorithm

03/08/2021
by   Yiting Kong, et al.
0

Merging into the highway from the on-ramp is an essential scenario for automated driving. The decision-making under the scenario needs to balance the safety and efficiency performance to optimize a long-term objective, which is challenging due to the dynamic, stochastic, and adversarial characteristics. The Rule-based methods often lead to conservative driving on this task while the learning-based methods have difficulties meeting the safety requirements. In this paper, we propose an RL-based end-to-end decision-making method under a framework of offline training and online correction, called the Shielded Distributional Soft Actor-critic (SDSAC). The SDSAC adopts the policy evaluation with safety consideration and a safety shield parameterized with the barrier function in its offline training and online correction, respectively. These two measures support each other for better safety while not damaging the efficiency performance severely. We verify the SDSAC on an on-ramp merge scenario in simulation. The results show that the SDSAC has the best safety performance compared to baseline algorithms and achieves efficient driving simultaneously.

READ FULL TEXT

page 1

page 4

research
07/14/2017

Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic

Freeway merging in congested traffic is a significant challenge toward f...
research
09/12/2021

Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

In this paper, we propose a new reinforcement learning (RL) algorithm, c...
research
02/13/2020

Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic

Reinforcement learning (RL) has achieved remarkable performance in a var...
research
10/19/2021

Watch out for the risky actors: Assessing risk in dynamic environments for safe driving

Driving in a dynamic environment that consists of other actors is inhere...
research
12/19/2020

Model-Based Actor-Critic with Chance Constraint for Stochastic System

Safety constraints are essential for reinforcement learning (RL) applied...
research
03/18/2021

Integrated Decision and Control: Towards Interpretable and Efficient Driving Intelligence

Decision and control are two of the core functionalities of high-level a...
research
09/20/2022

Rethink the Adversarial Scenario-based Safety Testing of Robots: the Comparability and Optimal Aggressiveness

This paper studies the class of scenario-based safety testing algorithms...

Please sign up or login with your details

Forgot password? Click here to reset