Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective

03/23/2023
by   Jinjing Zhu, et al.
0

Endeavors have been recently made to leverage the vision transformer (ViT) for the challenging unsupervised domain adaptation (UDA) task. They typically adopt the cross-attention in ViT for direct domain alignment. However, as the performance of cross-attention highly relies on the quality of pseudo labels for targeted samples, it becomes less effective when the domain gap becomes large. We solve this problem from a game theory's perspective with the proposed model dubbed as PMTrans, which bridges source and target domains with an intermediate domain. Specifically, we propose a novel ViT-based module called PatchMix that effectively builds up the intermediate domain, i.e., probability distribution, by learning to sample patches from both domains based on the game-theoretical models. This way, it learns to mix the patches from the source and target domains to maximize the cross entropy (CE), while exploiting two semi-supervised mixup losses in the feature and label spaces to minimize it. As such, we interpret the process of UDA as a min-max CE game with three players, including the feature extractor, classifier, and PatchMix, to find the Nash Equilibria. Moreover, we leverage attention maps from ViT to re-weight the label of each patch by its importance, making it possible to obtain more domain-discriminative feature representations. We conduct extensive experiments on four benchmark datasets, and the results show that PMTrans significantly surpasses the ViT-based and CNN-based SoTA methods by +3.6 +1.4

READ FULL TEXT

page 16

page 19

research
11/21/2019

Improving Unsupervised Domain Adaptation with Variational Information Bottleneck

Domain adaptation aims to leverage the supervision signal of source doma...
research
02/27/2022

Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation

Unsupervised domain adaptation (UDA) aims to learn transferable knowledg...
research
03/20/2020

Domain Adaptation by Class Centroid Matching and Local Manifold Self-Learning

Domain adaptation has been a fundamental technology for transferring kno...
research
02/24/2022

Towards Unsupervised Domain Adaptation via Domain-Transformer

As a vital problem in pattern analysis and machine intelligence, Unsuper...
research
03/09/2022

Dynamic Instance Domain Adaptation

Most existing studies on unsupervised domain adaptation (UDA) assume tha...
research
04/16/2022

Safe Self-Refinement for Transformer-based Domain Adaptation

Unsupervised Domain Adaptation (UDA) aims to leverage a label-rich sourc...
research
02/19/2022

BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian Perspective

Triplet loss, one of the deep metric learning (DML) methods, is to learn...

Please sign up or login with your details

Forgot password? Click here to reset