Simulation to scaled city: zero-shot policy transfer for traffic control via autonomous vehicles

12/14/2018
by   Kathy Jang, et al.
0

Using deep reinforcement learning, we train control policies for autonomous vehicles leading a platoon of vehicles onto a roundabout. Using Flow, a library for deep reinforcement learning in micro-simulators, we train two policies, one policy with noise injected into the state and action space and one without any injected noise. In simulation, the autonomous vehicle learns an emergent metering behavior for both policies in which it slows to allow for smoother merging. We then directly transfer this policy without any tuning to the University of Delaware Scaled Smart City (UDSSC), a 1:25 scale testbed for connected and automated vehicles. We characterize the performance of both policies on the scaled city. We show that the noise-free policy winds up crashing and only occasionally metering. However, the noise-injected policy consistently performs the metering behavior and remains collision-free, suggesting that the noise helps with the zero-shot policy transfer. Additionally, the transferred, noise-injected policy leads to a 5 average travel time and a reduction of 22 Videos of the controllers can be found at https://sites.google.com/view/iccps-policy-transfer.

READ FULL TEXT

page 4

page 7

page 8

research
12/07/2018

Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control

Although deep reinforcement learning (deep RL) methods have lots of stre...
research
10/16/2017

Flow: Architecture and Benchmarking for Reinforcement Learning in Traffic Control

Flow is a new computational framework, built to support a key need trigg...
research
11/24/2018

Hardware Conditioned Policies for Multi-Robot Transfer Learning

Deep reinforcement learning could be used to learn dexterous robotic pol...
research
03/05/2019

Demonstration of a Time-Efficient Mobility System Using a Scaled Smart City

The implementation of connected and automated vehicle (CAV) technologies...
research
05/22/2021

Sugestões de Rotas Personalizadas para Carrinheiros na Coleta Seletiva de Materiais Recicláveis

Carrinheiros are collectors of recyclable materials that use human-power...
research
04/25/2023

Zero-shot Transfer Learning of Driving Policy via Socially Adversarial Traffic Flow

Acquiring driving policies that can transfer to unseen environments is c...
research
09/18/2023

Zero-Shot Policy Transferability for the Control of a Scale Autonomous Vehicle

We report on a study that employs an in-house developed simulation infra...

Please sign up or login with your details

Forgot password? Click here to reset