Log In Sign Up

BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief

by   Nora Kassner, et al.

Although pretrained language models (PTLMs) contain significant amounts of world knowledge, they can still produce inconsistent answers to questions when probed, even after specialized training. As a result, it can be hard to identify what the model actually "believes" about the world, making it susceptible to inconsistent behavior and simple errors. Our goal is to reduce these problems. Our approach is to embed a PTLM in a broader system that also includes an evolving, symbolic memory of beliefs – a BeliefBank – that records but then may modify the raw PTLM answers. We describe two mechanisms to improve belief consistency in the overall system. First, a reasoning component – a weighted MaxSAT solver – revises beliefs that significantly clash with others. Second, a feedback component issues future queries to the PTLM using known beliefs as context. We show that, in a controlled experimental setting, these two mechanisms result in more consistent beliefs in the overall system, improving both the accuracy and consistency of its answers over time. This is significant as it is a first step towards PTLM-based architectures with a systematic notion of belief, enabling them to construct a more coherent picture of the world, and improve over time without model retraining.


page 1

page 2

page 3

page 4


Enriching a Model's Notion of Belief using a Persistent Memory

Although pretrained language models (PTLMs) have been shown to contain s...

Relevance Sensitive Non-Monotonic Inference on Belief Sequences

We present a method for relevance sensitive non-monotonic inference from...

Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

Do language models have beliefs about the world? Dennett (1995) famously...

Enhancing Self-Consistency and Performance of Pre-Trained Language Models through Natural Language Inference

While large pre-trained language models are powerful, their predictions ...

Towards Teachable Reasoning Systems

Our goal is a teachable reasoning system for question-answering (QA), wh...

Typical models: minimizing false beliefs

A knowledge system S describing a part of real world does in general not...

Learning what they think vs. learning what they do: The micro-foundations of vicarious learning

Vicarious learning is a vital component of organizational learning. We t...