Zero-shot Transfer Learning of Driving Policy via Socially Adversarial Traffic Flow

04/25/2023
by   Dongkun Zhang, et al.
0

Acquiring driving policies that can transfer to unseen environments is challenging when driving in dense traffic flows. The design of traffic flow is essential and previous studies are unable to balance interaction and safety-criticism. To tackle this problem, we propose a socially adversarial traffic flow. We propose a Contextual Partially-Observable Stochastic Game to model traffic flow and assign Social Value Orientation (SVO) as context. We then adopt a two-stage framework. In Stage 1, each agent in our socially-aware traffic flow is driven by a hierarchical policy where upper-level policy communicates genuine SVOs of all agents, which the lower-level policy takes as input. In Stage 2, each agent in the socially adversarial traffic flow is driven by the hierarchical policy where upper-level communicates mistaken SVOs, taken by the lower-level policy trained in Stage 1. Driving policy is adversarially trained through a zero-sum game formulation with upper-level policies, resulting in a policy with enhanced zero-shot transfer capability to unseen traffic flows. Comprehensive experiments on cross-validation verify the superior zero-shot transfer performance of our method.

READ FULL TEXT

page 2

page 13

page 14

research
03/13/2021

Error-Aware Policy Learning: Zero-Shot Generalization in Partially Observable Dynamic Environments

Simulation provides a safe and efficient way to generate useful data for...
research
12/03/2021

Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction

In most modern cities, traffic congestion is one of the most salient soc...
research
05/14/2021

Adversarial Learning for Zero-Shot Stance Detection on Social Media

Stance detection on social media can help to identify and understand sla...
research
12/14/2018

Simulation to scaled city: zero-shot policy transfer for traffic control via autonomous vehicles

Using deep reinforcement learning, we train control policies for autonom...
research
12/02/2022

Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula

ML-based motion planning is a promising approach to produce agents that ...
research
05/01/2023

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas

In social psychology, Social Value Orientation (SVO) describes an indivi...
research
04/09/2022

Improve Generalization of Driving Policy at Signalized Intersections with Adversarial Learning

Intersections are quite challenging among various driving scenes wherein...

Please sign up or login with your details

Forgot password? Click here to reset