Adversarial Training is Not Ready for Robot Learning

03/15/2021
by   Mathias Lechner, et al.
9

Adversarial training is an effective method to train deep learning models that are resilient to norm-bounded perturbations, with the cost of nominal performance drop. While adversarial training appears to enhance the robustness and safety of a deep model deployed in open-world decision-critical applications, counterintuitively, it induces undesired behaviors in robot learning settings. In this paper, we show theoretically and experimentally that neural controllers obtained via adversarial training are subjected to three types of defects, namely transient, systematic, and conditional errors. We first generalize adversarial training to a safety-domain optimization scheme allowing for more generic specifications. We then prove that such a learning process tends to cause certain error profiles. We support our theoretical results by a thorough experimental safety analysis in a robot-learning task. Our results suggest that adversarial training is not yet ready for robot learning.

READ FULL TEXT

page 1

page 5

page 6

research
02/02/2021

Recent Advances in Adversarial Training for Adversarial Robustness

Adversarial training is one of the most effective approaches defending a...
research
04/10/2018

Adversarial Training Versus Weight Decay

Performance-critical machine learning models should be robust to input p...
research
04/02/2022

Moment-based Adversarial Training for Embodied Language Comprehension

In this paper, we focus on a vision-and-language task in which a robot i...
research
07/01/2022

Efficient Adversarial Training With Data Pruning

Neural networks are susceptible to adversarial examples-small input pert...
research
07/08/2023

Sup-Norm Convergence of Deep Neural Network Estimator for Nonparametric Regression by Adversarial Training

We show the sup-norm convergence of deep neural network estimators with ...
research
04/15/2022

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Adversarial training (i.e., training on adversarially perturbed input da...
research
02/08/2021

Improving filling level classification with adversarial training

We investigate the problem of classifying - from a single image - the le...

Please sign up or login with your details

Forgot password? Click here to reset