Sort by Structure: Language Model Ranking as Dependency Probing

06/10/2022
by   Max Müller-Eberstein, et al.
0

Making an informed choice of pre-trained language model (LM) is critical for performance, yet environmentally costly, and as such widely underexplored. The field of Computer Vision has begun to tackle encoder ranking, with promising forays into Natural Language Processing, however they lack coverage of linguistic tasks such as structured prediction. We propose probing to rank LMs, specifically for parsing dependencies in a given language, by measuring the degree to which labeled trees are recoverable from an LM's contextualized embeddings. Across 46 typologically and architecturally diverse LM-language pairs, our probing approach predicts the best LM choice 79 orders of magnitude less compute than training a full parser. Within this study, we identify and analyze one recently proposed decoupled LM - RemBERT - and find it strikingly contains less inherent dependency information, but often yields the best parser after full fine-tuning. Without this outlier our approach identifies the best LM in 89

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2022

Probing for Labeled Dependency Trees

Probing has become an important tool for analyzing representations in Na...
research
10/23/2020

Graph-Based Universal Dependency Parsing in the Age of the Transformer: What Works, and What Doesn't

Current state-of-the-art graph-based dependency parsers differ on variou...
research
04/17/2021

Monotonicity Marking from Universal Dependency Trees

Dependency parsing is a tool widely used in the field of Natural languag...
research
09/04/2019

PaLM: A Hybrid Parser and Language Model

We present PaLM, a hybrid parser and neural language model. Building on ...
research
10/20/2022

Evidence > Intuition: Transferability Estimation for Encoder Selection

With the increase in availability of large pre-trained language models (...
research
06/08/2023

Hexatagging: Projective Dependency Parsing as Tagging

We introduce a novel dependency parser, the hexatagger, that constructs ...
research
05/24/2023

Structural Ambiguity and its Disambiguation in Language Model Based Parsers: the Case of Dutch Clause Relativization

This paper addresses structural ambiguity in Dutch relative clauses. By ...

Please sign up or login with your details

Forgot password? Click here to reset