Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation

10/18/2022
by   Jun-Kun Wang, et al.
0

We consider a setting that a model needs to adapt to a new domain under distribution shifts, given that only unlabeled test samples from the new domain are accessible at test time. A common idea in most of the related works is constructing pseudo-labels for the unlabeled test samples and applying gradient descent (GD) to a loss function with the pseudo-labels. Recently, Goyal et al. (2022) propose conjugate labels, which is a new kind of pseudo-labels for self-training at test time. They empirically show that the conjugate label outperforms other ways of pseudo-labeling on many domain adaptation benchmarks. However, provably showing that GD with conjugate labels learns a good classifier for test-time adaptation remains open. In this work, we aim at theoretically understanding GD with hard and conjugate labels for a binary classification problem. We show that for square loss, GD with conjugate labels converges to a solution that minimizes the testing 0-1 loss under a Gaussian model, while GD with hard pseudo-labels fails in this task. We also analyze them under different loss functions for the update. Our results shed lights on understanding when and why GD with hard labels or conjugate labels works in test-time adaptation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2022

Test-Time Adaptation via Conjugate Pseudo-labels

Test-time adaptation (TTA) refers to adapting neural networks to distrib...
research
01/15/2023

Rethinking Precision of Pseudo Label: Test-Time Adaptation via Complementary Learning

In this work, we propose a novel complementary learning approach to enha...
research
03/07/2023

Guiding Pseudo-labels with Uncertainty Estimation for Test-Time Adaptation

Standard Unsupervised Domain Adaptation (UDA) methods assume the availab...
research
06/15/2022

Test-Time Adaptation for Visual Document Understanding

Self-supervised pretraining has been able to produce transferable repres...
research
03/25/2023

Train/Test-Time Adaptation with Retrieval

We introduce Train/Test-Time Adaptation with Retrieval (T^3AR), a method...
research
04/21/2022

Contrastive Test-Time Adaptation

Test-time adaptation is a special setting of unsupervised domain adaptat...
research
07/08/2023

Learning Variational Neighbor Labels for Test-Time Domain Generalization

This paper strives for domain generalization, where models are trained e...

Please sign up or login with your details

Forgot password? Click here to reset