In-Context Analogical Reasoning with Pre-Trained Language Models

05/28/2023
by   Xiaoyang Hu, et al.
0

Analogical reasoning is a fundamental capacity of human cognition that allows us to reason abstractly about novel situations by relating them to past experiences. While it is thought to be essential for robust reasoning in AI systems, conventional approaches require significant training and/or hard-coding of domain knowledge to be applied to benchmark tasks. Inspired by cognitive science research that has found connections between human language and analogy-making, we explore the use of intuitive language-based abstractions to support analogy in AI systems. Specifically, we apply large pre-trained language models (PLMs) to visual Raven's Progressive Matrices (RPM), a common relational reasoning test. By simply encoding the perceptual features of the problem into language form, we find that PLMs exhibit a striking capacity for zero-shot relational reasoning, exceeding human performance and nearing supervised vision-based methods. We explore different encodings that vary the level of abstraction over task features, finding that higher-level abstractions further strengthen PLMs' analogical reasoning. Our detailed analysis reveals insights on the role of model complexity, in-context learning, and prior knowledge in solving RPM tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

Emergent Analogical Reasoning in Large Language Models

The recent advent of large language models - large neural networks train...
research
06/14/2022

Understanding Narratives through Dimensions of Analogy

Analogical reasoning is a powerful qualitative reasoning tool that enabl...
research
06/18/2018

Modularity Matters: Learning Invariant Relational Reasoning Tasks

We focus on two supervised visual reasoning tasks whose labels encode a ...
research
10/06/2022

Learning to Reason With Relational Abstractions

Large language models have recently shown promising progress in mathemat...
research
05/19/2023

Towards Human-AI Collaborative Urban Science Research Enabled by Pre-trained Large Language Models

Pre-trained large language models (PLMs) have the potential to support u...
research
06/21/2023

Opening the Black Box: Analyzing Attention Weights and Hidden States in Pre-trained Language Models for Non-language Tasks

Investigating deep learning language models has always been a significan...

Please sign up or login with your details

Forgot password? Click here to reset