Ensembles of Vision Transformers as a New Paradigm for Automated Classification in Ecology

03/03/2022
by   S. Kyathanahally, et al.
0

Monitoring biodiversity is paramount to manage and protect natural resources, particularly in times of global change. Collecting images of organisms over large temporal or spatial scales is a promising practice to monitor and study biodiversity change of natural ecosystems, providing large amounts of data with minimal interference with the environment. Deep learning models are currently used to automate classification of organisms into taxonomic units. However, imprecision in these classifiers introduce a measurement noise that is difficult to control and can significantly hinder the analysis and interpretation of data. In our study, we show that this limitation can be overcome by ensembles of Data-efficient image Transformers (DeiTs), which significantly outperform the previous state of the art (SOTA). We validate our results on a large number of ecological imaging datasets of diverse origin, and organisms of study ranging from plankton to insects, birds, dog breeds, animals in the wild, and corals. On all the data sets we test, we achieve a new SOTA, with a reduction of the error with respect to the previous SOTA ranging from 18.48 very close to perfect classification. The main reason why ensembles of DeiTs perform better is not due to the single-model performance of DeiTs, but rather to the fact that predictions by independent models have a smaller overlap, and this maximizes the profit gained by ensembling. This positions DeiT ensembles as the best candidate for image classification in biodiversity monitoring.

READ FULL TEXT

page 6

page 7

page 11

page 12

research
11/29/2021

On the Effectiveness of Neural Ensembles for Image Classification with Small Datasets

Deep neural networks represent the gold standard for image classificatio...
research
10/17/2019

Deep Sub-Ensembles for Fast Uncertainty Estimation in Image Classification

Fast estimates of model uncertainty are required for many robust robotic...
research
09/14/2022

Transformers and CNNs both Beat Humans on SBIR

Sketch-based image retrieval (SBIR) is the task of retrieving natural im...
research
10/21/2022

Boosting vision transformers for image retrieval

Vision transformers have achieved remarkable progress in vision tasks su...
research
06/24/2020

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Ensembles over neural network weights trained from different random init...
research
07/01/2023

More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data

Transformers are very powerful tools for a variety of tasks across domai...
research
05/01/2020

When Ensembling Smaller Models is More Efficient than Single Large Models

Ensembling is a simple and popular technique for boosting evaluation per...

Please sign up or login with your details

Forgot password? Click here to reset