Semantic Segmentation of Legal Documents via Rhetorical Roles

12/03/2021
by   Vijit Malik, et al.
17

Legal documents are unstructured, use legal jargon, and have considerable length, making it difficult to process automatically via conventional text processing techniques. A legal document processing system would benefit substantially if the documents could be semantically segmented into coherent units of information. This paper proposes a Rhetorical Roles (RR) system for segmenting a legal document into semantically coherent units: facts, arguments, statute, issue, precedent, ruling, and ratio. With the help of legal experts, we propose a set of 13 fine-grained rhetorical role labels and create a new corpus of legal documents annotated with the proposed RR. We develop a system for segmenting a document into rhetorical role units. In particular, we develop a multitask learning-based deep learning model with document rhetorical role label shift as an auxiliary task for segmenting a legal document. We experiment extensively with various deep learning models for predicting rhetorical roles in a document, and the proposed model shows superior performance over the existing models. Further, we apply RR for predicting the judgment of legal cases and show that the use of RR enhances the prediction compared to the transformer-based models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2022

Corpus for Automatic Structuring of Legal Documents

In populous countries, pending legal cases have been growing exponential...
research
11/13/2019

Identification of Rhetorical Roles of Sentences in Indian Legal Judgments

Automatically understanding the rhetorical roles of sentences in a legal...
research
05/06/2023

Rhetorical Role Labeling of Legal Documents using Transformers and Graph Neural Networks

A legal document is usually long and dense requiring human effort to par...
research
09/19/2023

A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents

Automatic legal judgment prediction and its explanation suffer from the ...
research
08/01/2017

An Investigation into the Pedagogical Features of Documents

Characterizing the content of a technical document in terms of its learn...
research
05/06/2022

Fine-grained Intent Classification in the Legal Domain

A law practitioner has to go through a lot of long legal case proceeding...
research
09/14/2020

Knowledge-Based Legal Document Assembly

This paper proposes a knowledge-based legal document assembly method tha...

Please sign up or login with your details

Forgot password? Click here to reset