Theory of Mind May Have Spontaneously Emerged in Large Language Models

02/04/2023
by   Michal Kosinski, et al.
0

Theory of mind (ToM), or the ability to impute unobservable mental states to others, is central to human social interactions, communication, empathy, self-consciousness, and morality. We tested several language models using 40 classic false-belief tasks widely used to test ToM in humans. The models published before 2020 showed virtually no ability to solve ToM tasks. Yet, the first version of GPT-3 ("davinci-001"), published in May 2020, solved about 40 of false-belief tasks-performance comparable with 3.5-year-old children. Its second version ("davinci-002"; January 2022) solved 70 performance comparable with six-year-olds. Its most recent version, GPT-3.5 ("davinci-003"; November 2022), solved 90 of seven-year-olds. GPT-4 published in March 2023 solved nearly all the tasks (95 uniquely human) may have spontaneously emerged as a byproduct of language models' improving language skills.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2022

Do Large Language Models know what humans know?

Humans can attribute mental states to others, a capacity known as Theory...
research
05/24/2023

ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind

Theory of Mind (ToM), the capacity to comprehend the mental states of di...
research
09/19/2023

Evaluating large language models' ability to understand metaphor and sarcasm using a screening test for Asperger syndrome

Metaphors and sarcasm are precious fruits of our highly-evolved social c...
research
05/23/2023

Does ChatGPT have Theory of Mind?

“Theory of Mind" (ToM) is the ability to understand human thinking and d...
research
09/04/2023

Unveiling Theory of Mind in Large Language Models: A Parallel to Single Neurons in the Human Brain

With their recent development, large language models (LLMs) have been fo...
research
06/21/2023

Understanding Social Reasoning in Language Models with Language Models

As Large Language Models (LLMs) become increasingly integrated into our ...
research
06/07/2023

ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models

Humor is a central aspect of human communication that has not been solve...

Please sign up or login with your details

Forgot password? Click here to reset