Metadata Might Make Language Models Better

11/18/2022
by   Kaspar Beelen, et al.
0

This paper discusses the benefits of including metadata when training language models on historical collections. Using 19th-century newspapers as a case study, we extend the time-masking approach proposed by Rosin et al., 2022 and compare different strategies for inserting temporal, political and geographical information into a Masked Language Model. After fine-tuning several DistilBERT on enhanced input data, we provide a systematic evaluation of these models on a set of evaluation tasks: pseudo-perplexity, metadata mask-filling and supervised classification. We find that showing relevant metadata to a language model has a beneficial impact and may even produce more robust and fairer models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2023

Using Language Models on Low-end Hardware

This paper evaluates the viability of using fixed language models for tr...
research
12/05/2022

Building Metadata Inference Using a Transducer Based Language Model

Solving the challenges of automatic machine translation of Building Auto...
research
08/24/2019

Release Strategies and the Social Impacts of Language Models

Large language models have a range of beneficial uses: they can assist i...
research
07/24/2023

Making Metadata More FAIR Using Large Language Models

With the global increase in experimental data artifacts, harnessing them...
research
03/29/2023

Personalised Language Modelling of Screen Characters Using Rich Metadata Annotations

Personalisation of language models for dialogue sensitises them to bette...
research
05/26/2023

DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization

Data users need relevant context and research expertise to effectively s...
research
09/01/2023

Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence

Due to language models' propensity to generate toxic or hateful response...

Please sign up or login with your details

Forgot password? Click here to reset