Do language models have coherent mental models of everyday things?

12/20/2022
by   Yuling Gu, et al.
15

When people think of everyday things like an "egg," they typically have a mental image associated with it. This commonsense knowledge helps us understand how these everyday things work and how to interact with them. For example, when someone tries to make a fried egg, they know that it has a shell and that it can be cracked open to reveal the egg white and yolk inside. However, if a system does not have a coherent picture of such everyday things, thinking that the egg yolk surrounds the shell, then it might have to resort to ridiculous approaches such as trying to scrape the egg yolk off the shell into the pan. Do language models have a coherent picture of such everyday things? To investigate this, we propose a benchmark dataset consisting of 100 everyday things, their parts, and the relationships between these parts. We observe that state-of-the-art pre-trained language models (LMs) like GPT-3 and Macaw have fragments of knowledge about these entities, but they fail to produce consistent parts mental models. We propose a simple extension to these LMs where we apply a constraint satisfaction layer on top of raw predictions from LMs to produce more consistent and accurate parts mental models of everyday things.

READ FULL TEXT

page 10

page 14

page 16

research
10/27/2022

Gendered Mental Health Stigma in Masked Language Models

Mental health stigma prevents many individuals from receiving the approp...
research
12/16/2021

DREAM: Uncovering Mental Models behind Language Models

To what extent do language models (LMs) build "mental models" of a scene...
research
03/15/2022

Things not Written in Text: Exploring Spatial Commonsense from Visual Signals

Spatial commonsense, the knowledge about spatial position and relationsh...
research
10/29/2021

MentalBERT: Publicly Available Pretrained Language Models for Mental Healthcare

Mental health is a critical issue in modern society, and mental disorder...
research
02/16/2023

The logic behind desirable sets of things, and its filter representation

We identify the logic behind the recent theory of coherent sets of desir...
research
05/12/2023

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Language models (LMs) are powerful tools for natural language processing...
research
11/17/2011

A Model of Spatial Thinking for Computational Intelligence

Trying to be effective (no matter who exactly and in what field) a perso...

Please sign up or login with your details

Forgot password? Click here to reset