A Statutory Article Retrieval Dataset in French

08/26/2021
by   Antoine Louis, et al.
0

Statutory article retrieval is the task of automatically retrieving law articles relevant to a legal question. While recent advances in natural language processing have sparked considerable interest in many legal tasks, statutory article retrieval remains primarily untouched due to the scarcity of large-scale and high-quality annotated datasets. To address this bottleneck, we introduce the Belgian Statutory Article Retrieval Dataset (BSARD), which consists of 1,100+ French native legal questions labeled by experienced jurists with relevant articles from a corpus of 22,600+ Belgian law articles. Using BSARD, we benchmark several unsupervised information retrieval methods based on term weighting and pooled embeddings. Our best performing baseline achieves 50.8 that there is still substantial room for improvement. By the specificity of the data domain and addressed task, BSARD presents a unique challenge problem for future research on legal information retrieval.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2023

Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks

Statutory article retrieval (SAR), the task of retrieving statute law ar...
research
02/15/2022

Case law retrieval: problems, methods, challenges and evaluations in the last 20 years

Case law retrieval is the retrieval of judicial decisions relevant to a ...
research
09/08/2023

CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

In recent years, great advances in pre-trained language models (PLMs) ha...
research
12/02/2021

Unsupervised Law Article Mining based on Deep Pre-Trained Language Representation Models with Application to the Italian Civil Code

Modeling law search and retrieval as prediction problems has recently em...
research
06/15/2023

Explaining Legal Concepts with Augmented Large Language Models (GPT-4)

Interpreting the meaning of legal open-textured terms is a key task of l...
research
04/18/2021

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset

While self-supervised learning has made rapid advances in natural langua...
research
12/14/2017

Passing the Brazilian OAB Exam: data preparation and some experiments

In Brazil, all legal professionals must demonstrate their knowledge of t...

Please sign up or login with your details

Forgot password? Click here to reset