Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways

12/06/2020
by   Branka Mirchevska, et al.
0

Well-established optimization-based methods can guarantee an optimal trajectory for a short optimization horizon, typically no longer than a few seconds. As a result, choosing the optimal trajectory for this short horizon may still result in a sub-optimal long-term solution. At the same time, the resulting short-term trajectories allow for effective, comfortable and provable safe maneuvers in a dynamic traffic environment. In this work, we address the question of how to ensure an optimal long-term driving strategy, while keeping the benefits of classical trajectory planning. We introduce a Reinforcement Learning based approach that coupled with a trajectory planner, learns an optimal long-term decision-making strategy for driving on highways. By online generating locally optimal maneuvers as actions, we balance between the infinite low-level continuous action space, and the limited flexibility of a fixed number of predefined standard lane-change actions. We evaluated our method on realistic scenarios in the open-source traffic simulator SUMO and were able to achieve better performance than the 4 benchmark approaches we compared against, including a random action selecting agent, greedy agent, high-level, discrete actions agent and an IDM-based SUMO-controlled agent.

READ FULL TEXT

page 1

page 5

research
03/30/2019

Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints

Autonomous driving decision-making is a great challenge due to the compl...
research
11/26/2020

An Autonomous Driving Framework for Long-term Decision-making and Short-term Trajectory Planning on Frenet Space

In this paper, we present a hierarchical framework for decision-making a...
research
03/21/2022

Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning

Implementing an autonomous vehicle that is able to output feasible, smoo...
research
11/09/2020

Trajectory Planning for Autonomous Vehicles Using Hierarchical Reinforcement Learning

Planning safe trajectories under uncertain and dynamic conditions makes ...
research
07/23/2020

Adaptable and Verifiable BDI Reasoning

Long-term autonomy requires autonomous systems to adapt as their capabil...
research
05/08/2020

Learning hierarchical behavior and motion planning for autonomous driving

Learning-based driving solution, a new branch for autonomous driving, is...
research
03/20/2020

Interpretable Multi Time-scale Constraints in Model-free Deep Reinforcement Learning for Autonomous Driving

In many real world applications, reinforcement learning agents have to o...

Please sign up or login with your details

Forgot password? Click here to reset