Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains

12/15/2021
by   Jaromír Šavelka, et al.
6

In this paper, we examine the use of multi-lingual sentence embeddings to transfer predictive models for functional segmentation of adjudicatory decisions across jurisdictions, legal systems (common and civil law), languages, and domains (i.e. contexts). Mechanisms for utilizing linguistic resources outside of their original context have significant potential benefits in AI Law because differences between legal systems, languages, or traditions often block wider adoption of research outcomes. We analyze the use of Language-Agnostic Sentence Representations in sequence labeling models using Gated Recurrent Units (GRUs) that are transferable across languages. To investigate transfer between different contexts we developed an annotation scheme for functional segmentation of adjudicatory decisions. We found that models generalize beyond the contexts on which they were trained (e.g., a model trained on administrative decisions from the US can be applied to criminal law decisions from Italy). Further, we found that training the models on multiple contexts increases robustness and improves overall performance when evaluating on previously unseen contexts. Finally, we found that pooling the training data from all the contexts enhances the models' in-context performance.

READ FULL TEXT
research
09/25/2022

An Empirical Study on Cross-X Transfer for Legal Judgment Prediction

Cross-lingual transfer learning has proven useful in a variety of Natura...
research
12/15/2021

Cross-Domain Generalization and Knowledge Transfer in Transformers Trained on Legal Data

We analyze the ability of pre-trained language models to transfer knowle...
research
08/11/2023

Large Language Models in Cryptocurrency Securities Cases: Can ChatGPT Replace Lawyers?

Large Language Models (LLMs) could enhance access to the legal system. H...
research
06/08/2023

Improving Vietnamese Legal Question–Answering System based on Automatic Data Enrichment

Question answering (QA) in law is a challenging problem because legal do...
research
07/18/2022

Using attention methods to predict judicial outcomes

Legal Judgment Prediction is one of the most acclaimed fields for the co...
research
08/25/2021

Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain

In this article, we explore the potential of transformer-based language ...
research
07/01/2019

Ich weiß, was du nächsten Sommer getan haben wirst: Predictive Policing in Österreich

Predictive policing is a data-based, predictive analytical technique use...

Please sign up or login with your details

Forgot password? Click here to reset