Inductive reasoning in humans and large language models

06/11/2023
by   Simon J. Han, et al.
0

The impressive recent performance of large language models has led many to wonder to what extent they can serve as models of general intelligence or are similar to human cognition. We address this issue by applying GPT-3 and GPT-4 to a classic problem in human inductive reasoning known as property induction. Over two experiments, we elicit human judgments on a range of property induction tasks spanning multiple domains. Although GPT-3 struggles to capture many aspects of human behaviour, GPT-4 is much more successful: for the most part, its performance qualitatively matches that of humans, and the only notable exception is its failure to capture the phenomenon of premise non-monotonicity. Overall, this work not only demonstrates that property induction is an interesting skill on which to compare human and machine intelligence, but also provides two large datasets that can serve as suitable benchmarks for future work in this vein.

READ FULL TEXT

page 4

page 10

page 13

page 18

research
11/04/2021

On Semantic Cognition, Inductive Generalization, and Language Models

My doctoral research focuses on understanding semantic knowledge in neur...
research
05/13/2022

A Property Induction Framework for Neural Language Models

To what extent can experience from language contribute to our conceptual...
research
12/20/2022

Towards Reasoning in Large Language Models: A Survey

Reasoning is a fundamental aspect of human intelligence that plays a cru...
research
08/09/2022

Limitations of Language Models in Arithmetic and Symbolic Induction

Recent work has shown that large pretrained Language Models (LMs) can no...
research
05/04/2020

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Large-scale pretrained language models are the major driving force behin...
research
02/16/2023

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

Intuitive psychology is a pillar of common-sense reasoning. The replicat...
research
03/10/2021

Fast and flexible: Human program induction in abstract reasoning tasks

The Abstraction and Reasoning Corpus (ARC) is a challenging program indu...

Please sign up or login with your details

Forgot password? Click here to reset