Automated Refugee Case Analysis: An NLP Pipeline for Supporting Legal Practitioners

05/24/2023
by   Claire Barale, et al.
0

In this paper, we introduce an end-to-end pipeline for retrieving, processing, and extracting targeted information from legal cases. We investigate an under-studied legal domain with a case study on refugee law in Canada. Searching case law for past similar cases is a key part of legal work for both lawyers and judges, the potential end-users of our prototype. While traditional named-entity recognition labels such as dates provide meaningful information in legal work, we propose to extend existing models and retrieve a total of 19 useful categories of items from refugee cases. After creating a novel data set of cases, we perform information extraction based on state-of-the-art neural named-entity recognition (NER). We test different architectures including two transformer models, using contextual and non-contextual embeddings, and compare general purpose versus domain-specific pre-training. The results demonstrate that models pre-trained on legal data perform best despite their smaller size, suggesting that domain matching had a larger effect than network architecture. We achieve a F1 score above 90 five of the targeted categories and over 80

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

E-NER – An Annotated Named Entity Recognition Corpus of Legal Text

Identifying named entities such as a person, location or organization, i...
research
12/19/2022

Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?

Named Entity Recognition (NER) is an important and well-studied task in ...
research
10/03/2019

Extracting UMLS Concepts from Medical Text Using General and Domain-Specific Deep Learning Models

Entity recognition is a critical first step to a number of clinical NLP ...
research
11/14/2017

A visual search engine for Bangladeshi laws

Browsing and finding relevant information for Bangladeshi laws is a chal...
research
02/27/2022

Enhancing Legal Argument Mining with Domain Pre-training and Neural Networks

The contextual word embedding model, BERT, has proved its ability on dow...
research
03/10/2022

Semantic Norm Recognition and its application to Portuguese Law

Being able to clearly interpret legal texts and fully understanding our ...

Please sign up or login with your details

Forgot password? Click here to reset