Mind meets machine: Unravelling GPT-4's cognitive psychology

03/20/2023
by   Sifatkaur, et al.
8

Commonsense reasoning is a basic ingredient of intelligence in humans, empowering the ability to deduce conclusions based on the observations of surroundings. Large language models (LLMs) are emerging as potent tools increasingly capable of performing human-level tasks. The recent development in the form of GPT-4 and its demonstrated success in tasks complex to humans such as medical exam, bar exam and others has led to an increased confidence in the LLMs to become perfect instruments of intelligence. Though, the GPT-4 paper has shown performance on some common sense reasoning tasks, a comprehensive assessment of GPT-4 on common sense reasoning tasks, particularly on the existing well-established datasets is missing. In this study, we focus on the evaluation of GPT-4's performance on a set of common sense reasoning questions from the widely used CommonsenseQA dataset along with tools from cognitive psychology. In doing so, we understand how GPT-4 processes and integrates common sense knowledge with contextual information, providing insight into the underlying cognitive processes that enable its ability to generate common sense responses. We show that GPT-4 exhibits a high level of accuracy in answering common sense questions, outperforming its predecessor, GPT-3 and GPT-3.5. We show that the accuracy of GPT-4 on CommonSenseQA is 83 in the original study that human accuracy over the same data was 89 Although, GPT-4 falls short of the human performance, it is a substantial improvement from the original 56.5 CommonSenseQA study. Our results strengthen the already available assessments and confidence on GPT-4's common sense reasoning abilities which have significant potential to revolutionize the field of AI, by enabling machines to bridge the gap between human and machine reasoning.

READ FULL TEXT

page 4

page 5

research
04/18/2021

CoreQuisite: Circumstantial Preconditions of Common Sense Knowledge

The task of identifying and reasoning with circumstantial preconditions ...
research
03/22/2023

Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMs

The potential of large language models (LLMs) to reason like humans has ...
research
03/11/2020

Uncovering the Data-Related Limits of Human Reasoning Research: An Analysis based on Recommender Systems

Understanding the fundamentals of human reasoning is central to the deve...
research
02/16/2023

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

Intuitive psychology is a pillar of common-sense reasoning. The replicat...
research
09/17/2021

Repurposing of Resources: from Everyday Problem Solving through to Crisis Management

The human ability to repurpose objects and processes is universal, but i...
research
04/25/2022

A very preliminary analysis of DALL-E 2

The DALL-E 2 system generates original synthetic images corresponding to...
research
06/15/2020

Machine Common Sense

Machine common sense remains a broad, potentially unbounded problem in a...

Please sign up or login with your details

Forgot password? Click here to reset