MS-Net: A Multi-modal Self-supervised Network for Fine-Grained Classification of Aircraft in SAR Images

08/28/2023
by   Bingying Yue, et al.
0

Synthetic aperture radar (SAR) imaging technology is commonly used to provide 24-hour all-weather earth observation. However, it still has some drawbacks in SAR target classification, especially in fine-grained classification of aircraft: aircrafts in SAR images have large intra-class diversity and inter-class similarity; the number of effective samples is insufficient and it's hard to annotate. To address these issues, this article proposes a novel multi-modal self-supervised network (MS-Net) for fine-grained classification of aircraft. Firstly, in order to entirely exploit the potential of multi-modal information, a two-sided path feature extraction network (TSFE-N) is constructed to enhance the image feature of the target and obtain the domain knowledge feature of text mode. Secondly, a contrastive self-supervised learning (CSSL) framework is employed to effectively learn useful label-independent feature from unbalanced data, a similarity per-ception loss (SPloss) is proposed to avoid network overfitting. Finally, TSFE-N is used as the encoder of CSSL to obtain the classification results. Through a large number of experiments, our MS-Net can effectively reduce the difficulty of classifying similar types of aircrafts. In the case of no label, the proposed algorithm achieves an accuracy of 88.46 classification task, which has pioneering significance in the field of fine-grained classification of aircraft in SAR images.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

page 9

page 11

page 12

research
05/04/2022

Self-Supervised Learning for Invariant Representations from Multi-Spectral and SAR Images

Self-Supervised learning (SSL) has become the new state-of-art in severa...
research
12/14/2022

Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification

Object detection and classification using aerial images is a challenging...
research
12/22/2021

Fine-grained Multi-Modal Self-Supervised Learning

Multi-Modal Self-Supervised Learning from videos has been shown to impro...
research
04/03/2023

Multi-Modal Representation Learning with Text-Driven Soft Masks

We propose a visual-linguistic representation learning approach within a...
research
05/04/2022

Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial View Object Classification

Multi-modal aerial view object classification (MAVOC) in Automatic targe...
research
11/24/2022

Spatial Mixture-of-Experts

Many data have an underlying dependence on spatial location; it may be w...

Please sign up or login with your details

Forgot password? Click here to reset