Revision Transformers: Getting RiT of No-Nos

10/19/2022
by   Felix Friedrich, et al.
9

Current transformer language models (LM) are large-scale models with billions of parameters. They have been shown to provide high performances on a variety of tasks but are also prone to shortcut learning and bias. Addressing such incorrect model behavior via parameter adjustments is very costly. This is particularly problematic for updating dynamic concepts, such as moral values, which vary culturally or interpersonally. In this work, we question the current common practice of storing all information in the model parameters and propose the Revision Transformer (RiT) employing information retrieval to facilitate easy model updating. The specific combination of a large-scale pre-trained LM that inherently but also diffusely encodes world knowledge with a clear-structured revision engine makes it possible to update the model's knowledge with little effort and the help of user interaction. We exemplify RiT on a moral dataset and simulate user feedback demonstrating strong performance in model revision even with small data. This way, users can easily design a model regarding their preferences, paving the way for more transparent and personalized AI models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2023

Trainable Transformer in Transformer

Recent works attribute the capability of in-context learning (ICL) in la...
research
12/01/2020

Modifying Memories in Transformer Models

Large Transformer models have achieved impressive performance in many na...
research
07/19/2023

Thrust: Adaptively Propels Large Language Models with External Knowledge

Although large-scale pre-trained language models (PTLMs) are shown to en...
research
10/13/2022

Mass-Editing Memory in a Transformer

Recent work has shown exciting promise in updating large language models...
research
05/05/2021

Rethinking Search: Making Experts out of Dilettantes

When experiencing an information need, users want to engage with an expe...
research
07/14/2023

MGit: A Model Versioning and Management System

Models derived from other models are extremely common in machine learnin...

Please sign up or login with your details

Forgot password? Click here to reset