Attention-gating for improved radio galaxy classification

by   Micah Bowles, et al.

In this work we introduce attention as a state of the art mechanism for classification of radio galaxies using convolutional neural networks. We present an attention-based model that performs on par with previous classifiers while using more than 50% fewer parameters than the next smallest classic CNN application in this field. We demonstrate quantitatively how the selection of normalisation and aggregation methods used in attention-gating can affect the output of individual models, and show that the resulting attention maps can be used to interpret the classification choices made by the model. We observe that the salient regions identified by the our model align well with the regions an expert human classifier would attend to make equivalent classifications. We show that while the selection of normalisation and aggregation may only minimally affect the performance of individual models, it can significantly affect the interpretability of the respective attention maps and by selecting a model which aligns well with how astronomers classify radio sources by eye, a user can employ the model in a more effective manner.


page 8

page 9

page 10

page 11

page 12

page 13

page 15

page 16


Morphological Classification of Extragalactic Radio Sources Using Gradient Boosting Methods

The field of radio astronomy is witnessing a boom in the amount of data ...

Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type

Deep Learning models like Convolutional Neural Networks (CNN) are powerf...

E(2) Equivariant Self-Attention for Radio Astronomy

In this work we introduce group-equivariant self-attention models to add...

A Deep Neural Network for Audio Classification with a Classifier Attention Mechanism

Audio classification is considered as a challenging problem in pattern r...

M3d-CAM: A PyTorch library to generate 3D data attention maps for medical deep learning

M3d-CAM is an easy to use library for generating attention maps of CNN-b...

Pre-training Attention Mechanisms

Recurrent neural networks with differentiable attention mechanisms have ...

LAP: An Attention-Based Module for Faithful Interpretation and Knowledge Injection in Convolutional Neural Networks

Despite the state-of-the-art performance of deep convolutional neural ne...

Please sign up or login with your details

Forgot password? Click here to reset