Transformer-based Approaches for Legal Text Processing

02/13/2022
by   Ha-Thanh Nguyen, et al.
0

In this paper, we introduce our approaches using Transformer-based models for different problems of the COLIEE 2021 automatic legal text processing competition. Automated processing of legal documents is a challenging task because of the characteristics of legal documents as well as the limitation of the amount of data. With our detailed experiments, we found that Transformer-based pretrained language models can perform well with automated legal text processing problems with appropriate approaches. We describe in detail the processing steps for each task such as problem formulation, data processing and augmentation, pretraining, finetuning. In addition, we introduce to the community two pretrained models that take advantage of parallel translations in legal domain, NFSP and NMSP. In which, NFSP achieves the state-of-the-art result in Task 5 of the competition. Although the paper focuses on technical reporting, the novelty of its approaches can also be an useful reference in automated legal document processing using Transformer-based models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2022

Pre-training Transformers on Indian Legal Text

Natural Language Processing in the legal domain been benefited hugely by...
research
05/06/2023

Rhetorical Role Labeling of Legal Documents using Transformers and Graph Neural Networks

A legal document is usually long and dense requiring human effort to par...
research
04/15/2021

Sublanguage: A Serious Issue Affects Pretrained Models in Legal Domain

Legal English is a sublanguage that is important for everyone but not fo...
research
12/13/2021

Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer

Given the fact of a case, Legal Judgment Prediction (LJP) involves a ser...
research
12/14/2021

Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models

Legal texts routinely use concepts that are difficult to understand. Law...
research
08/11/2023

Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models

Auditing financial documents is a very tedious and time-consuming proces...
research
02/15/2022

BLUE at Memotion 2.0 2022: You have my Image, my Text and my Transformer

Memes are prevalent on the internet and continue to grow and evolve alon...

Please sign up or login with your details

Forgot password? Click here to reset