ARC: Adversarially Robust Control Policies for Autonomous Vehicles

07/09/2021
by   Sampo Kuutti, et al.
0

Deep neural networks have demonstrated their capability to learn control policies for a variety of tasks. However, these neural network-based policies have been shown to be susceptible to exploitation by adversarial agents. Therefore, there is a need to develop techniques to learn control policies that are robust against adversaries. We introduce Adversarially Robust Control (ARC), which trains the protagonist policy and the adversarial policy end-to-end on the same loss. The aim of the protagonist is to maximise this loss, whilst the adversary is attempting to minimise it. We demonstrate the proposed ARC training in a highway driving scenario, where the protagonist controls the follower vehicle whilst the adversary controls the lead vehicle. By training the protagonist against an ensemble of adversaries, it learns a significantly more robust control policy, which generalises to a variety of adversarial strategies. The approach is shown to reduce the amount of collisions against new adversaries by up to 90.25 policy. Moreover, by utilising an auxiliary distillation loss, we show that the fine-tuned control policy shows no drop in performance across its original training distribution.

READ FULL TEXT
research
02/27/2020

Training Adversarial Agents to Exploit Weaknesses in Deep Control Policies

Deep learning has become an increasingly common technique for various co...
research
05/30/2017

Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation

Multisensory polices are known to enhance both state estimation and targ...
research
07/01/2020

Falsification-Based Robust Adversarial Reinforcement Learning

Reinforcement learning (RL) has achieved tremendous progress in solving ...
research
03/08/2019

Improved Robustness and Safety for Autonomous Vehicle Control with Adversarial Reinforcement Learning

To improve efficiency and reduce failures in autonomous vehicles, resear...
research
07/09/2021

Adversarial Mixture Density Networks: Learning to Drive Safely from Collision Data

Imitation learning has been widely used to learn control policies for au...
research
04/02/2015

End-to-End Training of Deep Visuomotor Policies

Policy search methods can allow robots to learn control policies for a w...
research
02/27/2023

PolyScope: Multi-Policy Access Control Analysis to Triage Android Scoped Storage

Android's filesystem access control is a crucial aspect of its system in...

Please sign up or login with your details

Forgot password? Click here to reset