A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents

09/19/2023
by   Nishchal Prasad, et al.
0

Automatic legal judgment prediction and its explanation suffer from the problem of long case documents exceeding tens of thousands of words, in general, and having a non-uniform structure. Predicting judgments from such documents and extracting their explanation becomes a challenging task, more so on documents with no structural annotation. We define this problem as "scarce annotated legal documents" and explore their lack of structural information and their long lengths with a deep learning-based classification framework which we call MESc; "Multi-stage Encoder-based Supervised with-clustering"; for judgment prediction. Specifically, we divide a document into parts to extract their embeddings from the last four layers of a custom fine-tuned Large Language Model, and try to approximate their structure through unsupervised clustering. Which we use in another set of transformer encoder layers to learn the inter-chunk representations. We explore the adaptability of LLMs with multi-billion parameters (GPT-Neo, and GPT-J) to legal texts and their intra-domain(legal) transfer learning capacity. Alongside this, we compare their performance with MESc and the impact of combining embeddings from their last layers. For such hierarchical models, we also propose an explanation extraction algorithm named ORSE; Occlusion sensitivity-based Relevant Sentence Extractor;

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

An automated system that could assist a judge in predicting the outcome ...
research
12/03/2021

Semantic Segmentation of Legal Documents via Rhetorical Roles

Legal documents are unstructured, use legal jargon, and have considerabl...
research
02/11/2023

A Brief Report on LawGPT 1.0: A Virtual Legal Assistant Based on GPT-3

LawGPT 1.0 is a virtual legal assistant built on the state-of-the-art la...
research
03/21/2023

Understand Legal Documents with Contextualized Large Language Models

The growth of pending legal cases in populous countries, such as India, ...
research
09/14/2018

Automatic Catchphrase Extraction from Legal Case Documents via Scoring using Deep Neural Networks

In this paper, we present a method of automatic catchphrase extracting f...
research
08/11/2023

Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models

Auditing financial documents is a very tedious and time-consuming proces...
research
10/15/2018

Named-Entity Linking Using Deep Learning For Legal Documents: A Transfer Learning Approach

In the legal domain it is important to differentiate between words in ge...

Please sign up or login with your details

Forgot password? Click here to reset