How Well Do Vision Transformers (VTs) Transfer To The Non-Natural Image Domain? An Empirical Study Involving Art Classification

08/09/2022
by   Vincent Tonkes, et al.
4

Vision Transformers (VTs) are becoming a valuable alternative to Convolutional Neural Networks (CNNs) when it comes to problems involving high-dimensional and spatially organized inputs such as images. However, their Transfer Learning (TL) properties are not yet well studied, and it is not fully known whether these neural architectures can transfer across different domains as well as CNNs. In this paper we study whether VTs that are pre-trained on the popular ImageNet dataset learn representations that are transferable to the non-natural image domain. To do so we consider three well-studied art classification problems and use them as a surrogate for studying the TL potential of four popular VTs. Their performance is extensively compared against that of four common CNNs across several TL experiments. Our results show that VTs exhibit strong generalization properties and that these networks are more powerful feature extractors than CNNs.

READ FULL TEXT

page 4

page 12

research
07/17/2023

Study of Vision Transformers for Covid-19 Detection from Chest X-rays

The COVID-19 pandemic has led to a global health crisis, highlighting th...
research
11/15/2017

Can CNNs Construct Highly Accurate Model Efficiently with Limited Training Samples?

It is well known that metamodel or surrogate modeling techniques have be...
research
04/19/2023

Analyzing the Domain Shift Immunity of Deep Homography Estimation

Homography estimation is a basic image-alignment method in many applicat...
research
01/26/2022

Training Vision Transformers with Only 2040 Images

Vision Transformers (ViTs) is emerging as an alternative to convolutiona...
research
11/12/2021

Convolutional Nets Versus Vision Transformers for Diabetic Foot Ulcer Classification

This paper compares well-established Convolutional Neural Networks (CNNs...
research
09/19/2023

Exploring the Influence of Information Entropy Change in Learning Systems

In this work, we explore the influence of entropy change in deep learnin...
research
08/04/2022

MVSFormer: Multi-View Stereo with Pre-trained Vision Transformers and Temperature-based Depth

Feature representation learning is the key recipe for learning-based Mul...

Please sign up or login with your details

Forgot password? Click here to reset