Decoder-free Robustness Disentanglement without (Additional) Supervision

07/02/2020
by   Yifei Wang, et al.
0

Adversarial Training (AT) is proposed to alleviate the adversarial vulnerability of machine learning models by extracting only robust features from the input, which, however, inevitably leads to severe accuracy reduction as it discards the non-robust yet useful features. This motivates us to preserve both robust and non-robust features and separate them with disentangled representation learning. Our proposed Adversarial Asymmetric Training (AAT) algorithm can reliably disentangle robust and non-robust representations without additional supervision on robustness. Empirical results show our method does not only successfully preserve accuracy by combining two representations, but also achieve much better disentanglement than previous work.

READ FULL TEXT

page 8

page 14

page 15

research
10/26/2022

Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness

Adversarial vulnerability remains a major obstacle to constructing relia...
research
06/05/2022

Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial Training

Adversarial training has been widely explored for mitigating attacks aga...
research
04/26/2022

On Fragile Features and Batch Normalization in Adversarial Training

Modern deep learning architecture utilize batch normalization (BN) to st...
research
12/08/2021

On visual self-supervision and its effect on model robustness

Recent self-supervision methods have found success in learning feature r...
research
10/08/2022

Robustness of Unsupervised Representation Learning without Labels

Unsupervised representation learning leverages large unlabeled datasets ...
research
05/05/2022

Can collaborative learning be private, robust and scalable?

We investigate the effectiveness of combining differential privacy, mode...
research
09/28/2020

Universal Physiological Representation Learning with Soft-Disentangled Rateless Autoencoders

Human computer interaction (HCI) involves a multidisciplinary fusion of ...

Please sign up or login with your details

Forgot password? Click here to reset