Domain Generalisation with Bidirectional Encoder Representations from Vision Transformers

07/16/2023
by   Hamza Riaz, et al.
0

Domain generalisation involves pooling knowledge from source domain(s) into a single model that can generalise to unseen target domain(s). Recent research in domain generalisation has faced challenges when using deep learning models as they interact with data distributions which differ from those they are trained on. Here we perform domain generalisation on out-of-distribution (OOD) vision benchmarks using vision transformers. Initially we examine four vision transformer architectures namely ViT, LeViT, DeiT, and BEIT on out-of-distribution data. As the bidirectional encoder representation from image transformers (BEIT) architecture performs best, we use it in further experiments on three benchmarks PACS, Home-Office and DomainNet. Our results show significant improvements in validation and test accuracy and our implementation significantly overcomes gaps between within-distribution and OOD data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/15/2022

Domain Adaptation via Bidirectional Cross-Attention Transformer

Domain Adaptation (DA) aims to leverage the knowledge learned from a sou...
research
08/30/2023

Learning Diverse Features in Vision Transformers for Improved Generalization

Deep learning models often rely only on a small set of features even whe...
research
08/18/2022

Prompt Vision Transformer for Domain Generalization

Though vision transformers (ViTs) have exhibited impressive ability for ...
research
04/16/2022

Safe Self-Refinement for Transformer-based Domain Adaptation

Unsupervised Domain Adaptation (UDA) aims to leverage a label-rich sourc...
research
04/28/2023

Representation Matters: The Game of Chess Poses a Challenge to Vision Transformers

While transformers have gained the reputation as the "Swiss army knife o...
research
01/31/2022

Learning affective meanings that derives the social behavior using Bidirectional Encoder Representations from Transformers

Predicting the outcome of a process requires modeling the system dynamic...
research
12/05/2022

Solving the Weather4cast Challenge via Visual Transformers for 3D Images

Accurately forecasting the weather is an important task, as many real-wo...

Please sign up or login with your details

Forgot password? Click here to reset