Image-based Automated Species Identification: Can Virtual Data Augmentation Overcome Problems of Insufficient Sampling?

10/18/2020
by   Morris Klasen, et al.
0

Automated species identification and delimitation is challenging, particularly in rare and thus often scarcely sampled species, which do not allow sufficient discrimination of infraspecific versus interspecific variation. Typical problems arising from either low or exaggerated interspecific morphological differentiation are best met by automated methods of machine learning that learn efficient and effective species identification from training samples. However, limited infraspecific sampling remains a key challenge also in machine learning. 1In this study, we assessed whether a two-level data augmentation approach may help to overcome the problem of scarce training data in automated visual species identification. The first level of visual data augmentation applies classic approaches of data augmentation and generation of faked images using a GAN approach. Descriptive feature vectors are derived from bottleneck features of a VGG-16 convolutional neural network (CNN) that are then stepwise reduced in dimensionality using Global Average Pooling and PCA to prevent overfitting. The second level of data augmentation employs synthetic additional sampling in feature space by an oversampling algorithm in vector space (SMOTE). Applied on two challenging datasets of scarab beetles (Coleoptera), our augmentation approach outperformed a non-augmented deep learning baseline approach as well as a traditional 2D morphometric approach (Procrustes analysis).

READ FULL TEXT

page 6

page 14

research
09/28/2016

Understanding data augmentation for classification: when to warp?

In this paper we investigate the benefit of augmenting data with synthet...
research
11/29/2021

Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks

To protect tropical forest biodiversity, we need to be able to detect it...
research
04/11/2018

Attention Cropping: A Novel Data Augmentation Method for Real-world Plant Species Identification

This paper investigates the issue of realistic plant species identificat...
research
06/22/2019

Deep learning approach to description and classification of fungi microscopic images

Diagnosis of fungal infections can rely on microscopic examination, howe...
research
05/24/2020

Deep learning approach to describe and classify fungi microscopic images

Preliminary diagnosis of fungal infections can rely on microscopic exami...
research
11/22/2019

Computational Ceramicology

Field archeologists are called upon to identify potsherds, for which pur...
research
07/16/2021

Recognizing bird species in diverse soundscapes under weak supervision

We present a robust classification approach for avian vocalization in co...

Please sign up or login with your details

Forgot password? Click here to reset