Sector Patch Embedding: An Embedding Module Conforming to The Distortion Pattern of Fisheye Image

03/26/2023
by   Dianyi Yang, et al.
0

Fisheye cameras suffer from image distortion while having a large field of view(LFOV). And this fact leads to poor performance on some fisheye vision tasks. One of the solutions is to optimize the current vision algorithm for fisheye images. However, most of the CNN-based methods and the Transformer-based methods lack the capability of leveraging distortion information efficiently. In this work, we propose a novel patch embedding method called Sector Patch Embedding(SPE), conforming to the distortion pattern of the fisheye image. Furthermore, we put forward a synthetic fisheye dataset based on the ImageNet-1K and explore the performance of several Transformer models on the dataset. The classification top-1 accuracy of ViT and PVT is improved by 0.75 proposed sector patch embedding method can better perceive distortion and extract features on the fisheye images. Our method can be easily adopted to other Transformer-based models. Source code is at https://github.com/IN2-ViAUn/Sector-Patch-Embedding.

READ FULL TEXT

page 2

page 3

page 5

page 6

page 7

research
02/27/2021

Transformer in Transformer

Transformer is a type of self-attention-based neural networks originally...
research
03/06/2022

A Robust Framework of Chromosome Straightening with ViT-Patch GAN

Chromosomes exhibit non-rigid and non-articulated nature with varying de...
research
06/02/2022

Modeling Image Composition for Complex Scene Generation

We present a method that achieves state-of-the-art results on challengin...
research
11/16/2021

Improved Robustness of Vision Transformer via PreLayerNorm in Patch Embedding

Vision transformers (ViTs) have recently demonstrated state-of-the-art p...
research
08/21/2023

Patch Is Not All You Need

Vision Transformers have achieved great success in computer visions, del...
research
04/13/2023

VISION DIFFMASK: Faithful Interpretation of Vision Transformers with Differentiable Patch Masking

The lack of interpretability of the Vision Transformer may hinder its us...
research
11/07/2022

Image Completion with Heterogeneously Filtered Spectral Hints

Image completion with large-scale free-form missing regions is one of th...

Please sign up or login with your details

Forgot password? Click here to reset