DeepAI
Log In Sign Up

Prioritized Experience-based Reinforcement Learning with Human Guidance: Methdology and Application to Autonomous Driving

09/26/2021
by   Jingda Wu, et al.
0

Reinforcement learning requires skillful definition and remarkable computational efforts to solve optimization and control problems, which could impair its prospect. Introducing human guidance into reinforcement learning is a promising way to improve learning performance. In this paper, a comprehensive human guidance-based reinforcement learning framework is established. A novel prioritized experience replay mechanism that adapts to human guidance in the reinforcement learning process is proposed to boost the efficiency and performance of the reinforcement learning algorithm. To relieve the heavy workload on human participants, a behavior model is established based on an incremental online learning method to mimic human actions. We design two challenging autonomous driving tasks for evaluating the proposed algorithm. Experiments are conducted to access the training and testing performance and learning mechanism of the proposed algorithm. Comparative results against the state-of-the-arts suggest the advantages of our algorithm in terms of learning efficiency, performance, and robustness.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 7

page 10

page 12

page 13

04/15/2021

Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving

Due to the limited smartness and abilities of machine intelligence, curr...
11/29/2019

Simulation-based reinforcement learning for real-world autonomous driving

We use synthetic data and a reinforcement learning algorithm to train a ...
03/20/2021

RLTIR: Activity-based Interactive Person Identification based on Reinforcement Learning Tree

Identity recognition plays an important role in ensuring security in our...
02/18/2021

Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving

Currently, urban autonomous driving remains challenging because of the c...
03/09/2021

Computational Impact Time Guidance: A Learning-Based Prediction-Correction Approach

This paper investigates the problem of impact-time-control and proposes ...
11/05/2019

A Deep Reinforcement Learning based Approach to Learning Transferable Proof Guidance Strategies

Traditional first-order logic (FOL) reasoning systems usually rely on ma...