Contrastive Transformer: Contrastive Learning Scheme with Transformer innate Patches

03/26/2023
by   Sander Riisøen Jyhne, et al.
0

This paper presents Contrastive Transformer, a contrastive learning scheme using the Transformer innate patches. Contrastive Transformer enables existing contrastive learning techniques, often used for image classification, to benefit dense downstream prediction tasks such as semantic segmentation. The scheme performs supervised patch-level contrastive learning, selecting the patches based on the ground truth mask, subsequently used for hard-negative and hard-positive sampling. The scheme applies to all vision-transformer architectures, is easy to implement, and introduces minimal additional memory footprint. Additionally, the scheme removes the need for huge batch sizes, as each patch is treated as an image. We apply and test Contrastive Transformer for the case of aerial image segmentation, known for low-resolution data, large class imbalance, and similar semantic classes. We perform extensive experiments to show the efficacy of the Contrastive Transformer scheme on the ISPRS Potsdam aerial image segmentation dataset. Additionally, we show the generalizability of our scheme by applying it to multiple inherently different Transformer architectures. Ultimately, the results show a consistent increase in mean IoU across all classes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2021

Segmenter: Transformer for Semantic Segmentation

Image segmentation is often ambiguous at the level of individual image p...
research
06/12/2021

Contrastive Semi-Supervised Learning for 2D Medical Image Segmentation

Contrastive Learning (CL) is a recent representation learning approach, ...
research
08/07/2023

A Hybrid CNN-Transformer Architecture with Frequency Domain Contrastive Learning for Image Deraining

Image deraining is a challenging task that involves restoring degraded i...
research
05/24/2017

Dense Transformer Networks

The key idea of current deep learning methods for dense prediction is to...
research
05/19/2022

A graph-transformer for whole slide image classification

Deep learning is a powerful tool for whole slide image (WSI) analysis. T...
research
04/03/2023

U-Netmer: U-Net meets Transformer for medical image segmentation

The combination of the U-Net based deep learning models and Transformer ...
research
05/15/2023

Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation

This study introduces an efficacious approach, Masked Collaborative Cont...

Please sign up or login with your details

Forgot password? Click here to reset