Self-supervised Vision Transformers for Joint SAR-optical Representation Learning

04/11/2022
by   Yi Wang, et al.
10

Self-supervised learning (SSL) has attracted much interest in remote sensing and earth observation due to its ability to learn task-agnostic representations without human annotation. While most of the existing SSL works in remote sensing utilize ConvNet backbones and focus on a single modality, we explore the potential of vision transformers (ViTs) for joint SAR-optical representation learning. Based on DINO, a state-of-the-art SSL algorithm that distills knowledge from two augmented views of an input image, we combine SAR and optical imagery by concatenating all channels to a unified input. Subsequently, we randomly mask out channels of one modality as a data augmentation strategy. While training, the model gets fed optical-only, SAR-only, and SAR-optical image pairs learning both inner- and intra-modality representations. Experimental results employing the BigEarthNet-MM dataset demonstrate the benefits of both, the ViT backbones and the proposed multimodal SSL algorithm DINO-MM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2021

The QXS-SAROPT Dataset for Deep Learning in SAR-Optical Data Fusion

Deep learning techniques have made an increasing impact on the field of ...
research
08/31/2021

Contrastive Multiview Coding with Electro-optics for SAR Semantic Segmentation

In the training of deep learning models, how the model parameters are in...
research
01/08/2019

Translating SAR to Optical Images for Assisted Interpretation

Despite the advantages of all-weather and all-day high-resolution imagin...
research
03/12/2023

DINO-MC: Self-supervised Contrastive Learning for Remote Sensing Imagery with Multi-sized Local Crops

Due to the costly nature of remote sensing image labeling and the large ...
research
05/04/2022

Self-Supervised Learning for Invariant Representations from Multi-Spectral and SAR Images

Self-Supervised learning (SSL) has become the new state-of-art in severa...
research
06/27/2022

A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image Translation Task

This technical report summarizes the analysis and approach on the image-...
research
07/19/2021

Learning a Sensor-invariant Embedding of Satellite Data: A Case Study for Lake Ice Monitoring

Fusing satellite imagery acquired with different sensors has been a long...

Please sign up or login with your details

Forgot password? Click here to reset