Language-aware Domain Generalization Network for Cross-Scene Hyperspectral Image Classification

09/06/2022
by   Yuxiang Zhang, et al.
12

Text information including extensive prior knowledge about land cover classes has been ignored in hyperspectral image classification (HSI) tasks. It is necessary to explore the effectiveness of linguistic mode in assisting HSI classification. In addition, the large-scale pre-training image-text foundation models have demonstrated great performance in a variety of downstream applications, including zero-shot transfer. However, most domain generalization methods have never addressed mining linguistic modal knowledge to improve the generalization performance of model. To compensate for the inadequacies listed above, a Language-aware Domain Generalization Network (LDGnet) is proposed to learn cross-domain invariant representation from cross-domain shared prior knowledge. The proposed method only trains on the source domain (SD) and then transfers the model to the target domain (TD). The dual-stream architecture including image encoder and text encoder is used to extract visual and linguistic features, in which coarse-grained and fine-grained text representations are designed to extract two levels of linguistic features. Furthermore, linguistic features are used as cross-domain shared semantic space, and visual-linguistic alignment is completed by supervised contrastive learning in semantic space. Extensive experiments on three datasets demonstrate the superiority of the proposed method when compared with state-of-the-art techniques.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 9

research
09/04/2022

Single-source Domain Expansion Network for Cross-Scene Hyperspectral Image Classification

Currently, cross-scene hyperspectral image (HSI) classification has draw...
research
11/09/2021

FILIP: Fine-grained Interactive Language-Image Pre-Training

Unsupervised large-scale vision-language pre-training has shown promisin...
research
03/01/2023

Soft Prompt Guided Joint Learning for Cross-Domain Sentiment Analysis

Aspect term extraction is a fundamental task in fine-grained sentiment a...
research
12/19/2022

SrTR: Self-reasoning Transformer with Visual-linguistic Knowledge for Scene Graph Generation

Objects in a scene are not always related. The execution efficiency of t...
research
07/05/2020

Self-Challenging Improves Cross-Domain Generalization

Convolutional Neural Networks (CNN) conduct image classification by acti...
research
08/29/2018

Cross-Domain Collaborative Learning via Cluster Canonical Correlation Analysis and Random Walker for Hyperspectral Image Classification

This paper introduces a novel heterogenous domain adaptation (HDA) metho...
research
07/18/2023

Augmenting CLIP with Improved Visio-Linguistic Reasoning

Image-text contrastive models such as CLIP are useful for a variety of d...

Please sign up or login with your details

Forgot password? Click here to reset