Cross-Domain Generalization and Knowledge Transfer in Transformers Trained on Legal Data

12/15/2021
by   Jaromír Šavelka, et al.
0

We analyze the ability of pre-trained language models to transfer knowledge among datasets annotated with different type systems and to generalize beyond the domain and dataset they were trained on. We create a meta task, over multiple datasets focused on the prediction of rhetorical roles. Prediction of the rhetorical role a sentence plays in a case decision is an important and often studied task in AI Law. Typically, it requires the annotation of a large number of sentences to train a model, which can be time-consuming and expensive. Further, the application of the models is restrained to the same dataset it was trained on. We fine-tune language models and evaluate their performance across datasets, to investigate the models' ability to generalize across domains. Our results suggest that the approach could be helpful in overcoming the cold-start problem in active or interactvie learning, and shows the ability of the models to generalize across datasets and domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2023

Investigating Pre-trained Language Models on Cross-Domain Datasets, a Step Closer to General AI

Pre-trained language models have recently emerged as a powerful tool for...
research
03/26/2022

Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages

Human languages are full of metaphorical expressions. Metaphors help peo...
research
12/15/2021

Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains

In this paper, we examine the use of multi-lingual sentence embeddings t...
research
07/31/2023

DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures

Public figures receive a disproportionate amount of abuse on social medi...
research
03/02/2022

Large-Scale Hate Speech Detection with Cross-Domain Transfer

The performance of hate speech detection models relies on the datasets o...
research
07/12/2022

A new hope for network model generalization

Generalizing machine learning (ML) models for network traffic dynamics t...
research
08/21/2023

DepreSym: A Depression Symptom Annotated Corpus and the Role of LLMs as Assessors of Psychological Markers

Computational methods for depression detection aim to mine traces of dep...

Please sign up or login with your details

Forgot password? Click here to reset