ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind

05/24/2023
by   Xiaomeng Ma, et al.
0

Theory of Mind (ToM), the capacity to comprehend the mental states of distinct individuals, is essential for numerous practical applications. With the development of large language models, there is a heated debate about whether they are able to perform ToM tasks. Previous studies have used different tasks and prompts to test the ToM on large language models and the results are inconsistent: some studies asserted these models are capable of exhibiting ToM, while others suggest the opposite. In this study, We present ToMChallenges, a dataset for comprehensively evaluating Theory of Mind based on Sally-Anne and Smarties tests. We created 30 variations of each test (e.g., changing the person's name, location, and items). For each variation, we test the model's understanding of different aspects: reality, belief, 1st order belief, and 2nd order belief. We adapt our data for various tasks by creating unique prompts tailored for each task category: Fill-in-the-Blank, Multiple Choice, True/False, Chain-of-Thought True/False, Question Answering, and Text Completion. If the model has a robust ToM, it should be able to achieve good performance for different prompts across different tests. We evaluated two GPT-3.5 models, text-davinci-003 and gpt-3.5-turbo-0301, with our datasets. Our results indicate that consistent performance in ToM tasks remains a challenge.

READ FULL TEXT
research
02/04/2023

Theory of Mind May Have Spontaneously Emerged in Large Language Models

Theory of mind (ToM), or the ability to impute unobservable mental state...
research
08/28/2018

Evaluating Theory of Mind in Question Answering

We propose a new dataset for evaluating question answering models with r...
research
09/04/2022

Do Large Language Models know what humans know?

Humans can attribute mental states to others, a capacity known as Theory...
research
02/16/2023

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

Intuitive psychology is a pillar of common-sense reasoning. The replicat...
research
09/04/2023

Unveiling Theory of Mind in Large Language Models: A Parallel to Single Neurons in the Human Brain

With their recent development, large language models (LLMs) have been fo...
research
09/15/2023

Investigating the Applicability of Self-Assessment Tests for Personality Measurement of Large Language Models

As large language models (LLM) evolve in their capabilities, various rec...
research
04/17/2022

Learning Theory of Mind via Dynamic Traits Attribution

Machine learning of Theory of Mind (ToM) is essential to build social ag...

Please sign up or login with your details

Forgot password? Click here to reset