The Birds Need Attention Too: Analysing usage of Self Attention in identifying bird calls in soundscapes

11/14/2022
by   Chandra Kanth Nagesh, et al.
0

Birds are vital parts of ecosystems across the world and are an excellent measure of the quality of life on earth. Many bird species are endangered while others are already extinct. Ecological efforts in understanding and monitoring bird populations are important to conserve their habitat and species, but this mostly relies on manual methods in rough terrains. Recent advances in Machine Learning and Deep Learning have made automatic bird recognition in diverse environments possible. Birdcall recognition till now has been performed using convolutional neural networks. In this work, we try and understand how self-attention can aid in this endeavor. With that we build an pre-trained Attention-based Spectrogram Transformer baseline for BirdCLEF 2022 and compare the results against the pre-trained Convolution-based baseline. Our results show that the transformer models outperformed the convolutional model and we further validate our results by building baselines and analyzing the results for the previous year BirdCLEF 2021 challenge. Source code available at https://github.com/ck090/BirdCLEF-22

READ FULL TEXT

page 4

page 6

research
02/25/2020

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

Pre-trained language models (e.g., BERT (Devlin et al., 2018) and its va...
research
06/01/2022

CellCentroidFormer: Combining Self-attention and Convolution for Cell Detection

Cell detection in microscopy images is important to study how cells move...
research
05/02/2020

DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering

Transformer-based QA models use input-wide self-attention – i.e. across ...
research
06/24/2021

VOLO: Vision Outlooker for Visual Recognition

Visual recognition has been dominated by convolutional neural networks (...
research
07/16/2021

Recognizing bird species in diverse soundscapes under weak supervision

We present a robust classification approach for avian vocalization in co...
research
01/21/2021

DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character Recognition

In this work we tackle the challenging problem of anime character recogn...
research
07/30/2019

Efficient Method for Categorize Animals in the Wild

Automatic species classification in camera traps would greatly help the ...

Please sign up or login with your details

Forgot password? Click here to reset