Pre-training Transformers on Indian Legal Text

09/13/2022
by   Shounak Paul, et al.
0

Natural Language Processing in the legal domain been benefited hugely by the emergence of Transformer-based Pre-trained Language Models (PLMs) pre-trained on legal text. There exist PLMs trained over European and US legal text, most notably LegalBERT. However, with the rapidly increasing volume of NLP applications on Indian legal documents, and the distinguishing characteristics of Indian legal text, it has become necessary to pre-train LMs over Indian legal text as well. In this work, we introduce transformer-based PLMs pre-trained over a large corpus of Indian legal documents. We also apply these PLMs over several benchmark legal NLP tasks over Indian legal documents, namely, Legal Statute Identification from facts, Semantic segmentation of court judgements, and Court Judgement Prediction. Our experiments demonstrate the utility of the India-specific PLMs developed in this work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2021

Legal Transformer Models May Not Always Help

Deep learning-based Natural Language Processing methods, especially tran...
research
05/09/2021

Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents

Legal artificial intelligence (LegalAI) aims to benefit legal systems wi...
research
02/13/2022

Transformer-based Approaches for Legal Text Processing

In this paper, we introduce our approaches using Transformer-based model...
research
04/14/2022

Brazilian Court Documents Clustered by Similarity Together Using Natural Language Processing Approaches with Transformers

Recent advances in Artificial intelligence (AI) have leveraged promising...
research
03/14/2022

FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing

We present a benchmark suite of four datasets for evaluating the fairnes...
research
11/02/2022

Processing Long Legal Documents with Pre-trained Transformers: Modding LegalBERT and Longformer

Pre-trained Transformers currently dominate most NLP tasks. They impose,...
research
12/14/2021

Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models

Legal texts routinely use concepts that are difficult to understand. Law...

Please sign up or login with your details

Forgot password? Click here to reset