Evaluating the Deductive Competence of Large Language Models

09/11/2023
by   S. M. Seals, et al.
0

The development of highly fluent large language models (LLMs) has prompted increased interest in assessing their reasoning and problem-solving capabilities. We investigate whether several LLMs can solve a classic type of deductive reasoning problem from the cognitive science literature. The tested LLMs have limited abilities to solve these problems in their conventional form. We performed follow up experiments to investigate if changes to the presentation format and content improve model performance. We do find performance differences between conditions; however, they do not improve overall performance. Moreover, we find that performance interacts with presentation format and content in unexpected ways that differ from human performance. Overall, our results suggest that LLMs have unique reasoning biases that are only partially predicted from human reasoning performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2023

LLM Cognitive Judgements Differ From Human

Large Language Models (LLMs) have lately been on the spotlight of resear...
research
06/21/2023

Evaluating Large Language Models with NeuBAROCO: Syllogistic Reasoning Ability and Human-like Biases

This paper investigates whether current large language models exhibit bi...
research
06/14/2023

Revealing the structure of language model capabilities

Building a theoretical understanding of the capabilities of large langua...
research
09/15/2022

Measuring Geographic Performance Disparities of Offensive Language Classifiers

Text classifiers are applied at scale in the form of one-size-fits-all s...
research
10/12/2022

Can Pretrained Language Models (Yet) Reason Deductively?

Acquiring factual knowledge with Pretrained Language Models (PLMs) has a...
research
12/13/2022

Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety

Large language models (LLMs) have exploded in popularity in the past few...
research
09/12/2023

Re-Reading Improves Reasoning in Language Models

Reasoning presents a significant and challenging issue for Large Languag...

Please sign up or login with your details

Forgot password? Click here to reset