Exploring vision transformer layer choosing for semantic segmentation

05/02/2023
by   Fangjian Lin, et al.
0

Extensive work has demonstrated the effectiveness of Vision Transformers. The plain Vision Transformer tends to obtain multi-scale features by selecting fixed layers, or the last layer of features aiming to achieve higher performance in dense prediction tasks. However, this selection is often based on manual operation. And different samples often exhibit different features at different layers (e.g., edge, structure, texture, detail, etc.). This requires us to seek a dynamic adaptive fusion method to filter different layer features. In this paper, unlike previous encoder and decoder work, we design a neck network for adaptive fusion and feature selection, called ViTController. We validate the effectiveness of our method on different datasets and models and surpass previous state-of-the-art methods. Finally, our method can also be used as a plug-in module and inserted into different networks.

READ FULL TEXT
research
05/14/2022

Transformer Scale Gate for Semantic Segmentation

Effectively encoding multi-scale contextual information is crucial for a...
research
03/26/2022

Feature Selective Transformer for Semantic Image Segmentation

Recently, it has attracted more and more attentions to fuse multi-scale ...
research
01/12/2023

Adaptive Context Selection for Polyp Segmentation

Accurate polyp segmentation is of great significance for the diagnosis a...
research
07/28/2022

A Transformer-based Generative Adversarial Network for Brain Tumor Segmentation

Brain tumor segmentation remains a challenge in medical image segmentati...
research
10/10/2022

LAPFormer: A Light and Accurate Polyp Segmentation Transformer

Polyp segmentation is still known as a difficult problem due to the larg...
research
07/01/2019

Global Transformer U-Nets for Label-Free Prediction of Fluorescence Images

Visualizing the details of different cellular structures is of great imp...
research
12/14/2022

ContraFeat: Contrasting Deep Features for Semantic Discovery

StyleGAN has shown strong potential for disentangled semantic control, t...

Please sign up or login with your details

Forgot password? Click here to reset