PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture

06/26/2023
by   Mohit Vaishnav, et al.
0

We investigate the role of attention and memory in complex reasoning tasks. We analyze Transformer-based self-attention as a model and extend it with memory. By studying a synthetic visual reasoning test, we refine the taxonomy of reasoning tasks. Incorporating self-attention with ResNet50, we enhance feature maps using feature-based and spatial attention, achieving efficient solving of challenging visual reasoning tasks. Our findings contribute to understanding the attentional needs of SVRT tasks. Additionally, we propose GAMR, a cognitive architecture combining attention and memory, inspired by active vision theory. GAMR outperforms other architectures in sample efficiency, robustness, and compositionality, and shows zero-shot generalization on new reasoning tasks.

READ FULL TEXT

page 1

page 35

research
06/10/2022

GAMR: A Guided Attention Model for (visual) Reasoning

Humans continue to outperform modern AI systems in their ability to flex...
research
11/29/2021

Recurrent Vision Transformer for Solving Visual Reasoning Problems

Although convolutional neural networks (CNNs) showed remarkable results ...
research
11/14/2019

Attention on Abstract Visual Reasoning

Attention mechanisms have been boosting the performance of deep learning...
research
08/08/2021

Understanding the computational demands underlying visual reasoning

Visual understanding requires comprehending complex visual relations bet...
research
03/16/2018

A dataset and architecture for visual reasoning with a working memory

A vexing problem in artificial intelligence is reasoning about events th...
research
06/16/2020

Untangling tradeoffs between recurrence and self-attention in neural networks

Attention and self-attention mechanisms, inspired by cognitive processes...
research
09/02/2021

Studying the Effects of Self-Attention for Medical Image Analysis

When the trained physician interprets medical images, they understand th...

Please sign up or login with your details

Forgot password? Click here to reset