Understanding Character Recognition using Visual Explanations Derived from the Human Visual System and Deep Networks

08/10/2021
by   Chetan Ralekar, et al.
5

Human observers engage in selective information uptake when classifying visual patterns. The same is true of deep neural networks, which currently constitute the best performing artificial vision systems. Our goal is to examine the congruence, or lack thereof, in the information-gathering strategies of the two systems. We have operationalized our investigation as a character recognition task. We have used eye-tracking to assay the spatial distribution of information hotspots for humans via fixation maps and an activation mapping technique for obtaining analogous distributions for deep networks through visualization maps. Qualitative comparison between visualization maps and fixation maps reveals an interesting correlate of congruence. The deep learning model considered similar regions in character, which humans have fixated in the case of correctly classified characters. On the other hand, when the focused regions are different for humans and deep nets, the characters are typically misclassified by the latter. Hence, we propose to use the visual fixation maps obtained from the eye-tracking experiment as a supervisory input to align the model's focus on relevant character regions. We find that such supervision improves the model's performance significantly and does not require any additional parameters. This approach has the potential to find applications in diverse domains such as medical analysis and surveillance in which explainability helps to determine system fidelity.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 8

page 9

research
07/30/2021

Comparing object recognition in humans and deep convolutional neural networks – An eye tracking study

Deep convolutional neural networks (DCNNs) and the ventral visual pathwa...
research
10/12/2020

Diptychs of human and machine perceptions

We propose visual creations that put differences in algorithms and human...
research
12/02/2020

Dataset for Eye-Tracking Tasks

In recent years many different deep neural networks were developed, but ...
research
08/26/2023

Fixating on Attention: Integrating Human Eye Tracking into Vision Transformers

Modern transformer-based models designed for computer vision have outper...
research
08/26/2021

A Comparison of Deep Saliency Map Generators on Multispectral Data in Object Detection

Deep neural networks, especially convolutional deep neural networks, are...
research
10/12/2017

Can the early human visual system compete with Deep Neural Networks?

We study and compare the human visual system and state-of-the-art deep n...
research
11/19/2019

Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA

In this paper, we aim to obtain improved attention for a visual question...

Please sign up or login with your details

Forgot password? Click here to reset