TRAC: A Textual Benchmark for Reasoning about Actions and Change

11/25/2022
by   Weinan He, et al.
0

Reasoning about actions and change (RAC) is essential to understand and interact with the ever-changing environment. Previous AI research has shown the importance of fundamental and indispensable knowledge of actions, i.e., preconditions and effects. However, traditional methods rely on logical formalization which hinders practical applications. With recent transformer-based language models (LMs), reasoning over text is desirable and seemingly feasible, leading to the question of whether LMs can effectively and efficiently learn to solve RAC problems. We propose four essential RAC tasks as a comprehensive textual benchmark and generate problems in a way that minimizes the influence of other linguistic requirements (e.g., grounding) to focus on RAC. The resulting benchmark, TRAC, encompassing problems of various complexities, facilitates a more granular evaluation of LMs, precisely targeting the structural generalization ability much needed for RAC. Experiments with three high-performing transformers indicates that additional efforts are needed to tackle challenges raised by TRAC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2023

Teaching Probabilistic Logical Reasoning to Transformers

Recent research on transformer-based language models investigates their ...
research
12/17/2020

Can Transformers Reason About Effects of Actions?

A recent work has shown that transformers are able to "reason" with fact...
research
07/15/2022

Reasoning about Actions over Visual and Linguistic Modalities: A Survey

'Actions' play a vital role in how humans interact with the world and en...
research
07/28/2023

An Overview Of Temporal Commonsense Reasoning and Acquisition

Temporal commonsense reasoning refers to the ability to understand the t...
research
05/24/2023

SETI: Systematicity Evaluation of Textual Inference

We propose SETI (Systematicity Evaluation of Textual Inference), a novel...
research
05/12/2023

Knowledge Authoring for Rules and Actions

Knowledge representation and reasoning (KRR) systems describe and reason...
research
04/18/2023

Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions

The success of transformer models trained with a language modeling objec...

Please sign up or login with your details

Forgot password? Click here to reset