Indian Legal Text Summarization: A Text Normalisation-based Approach

06/13/2022
by   Satyajit Ghosh, et al.
0

In the Indian court system, pending cases have long been a problem. There are more than 4 crore cases outstanding. Manually summarising hundreds of documents is a time-consuming and tedious task for legal stakeholders. Many state-of-the-art models for text summarization have emerged as machine learning has progressed. Domain-independent models don't do well with legal texts, and fine-tuning those models for the Indian Legal System is problematic due to a lack of publicly available datasets. To improve the performance of domain-independent models, the authors have proposed a methodology for normalising legal texts in the Indian context. The authors experimented with two state-of-the-art domain-independent models for legal text summarization, namely BART and PEGASUS. BART and PEGASUS are put through their paces in terms of extractive and abstractive summarization to understand the effectiveness of the text normalisation approach. Summarised texts are evaluated by domain experts on multiple parameters and using ROUGE metrics. It shows the proposed text normalisation approach is effective in legal texts with domain-independent models.

READ FULL TEXT
research
11/13/2021

Robust Deep Reinforcement Learning for Extractive Legal Summarization

Automatic summarization of legal texts is an important and still a chall...
research
10/04/2017

Automatic Taxonomy Generation - A Use-Case in the Legal Domain

A key challenge in the legal domain is the adaptation and representation...
research
11/10/2019

Searching for Legal Clauses by Analogy. Few-shot Semantic Retrieval Shared Task

We introduce a novel shared task for semantic retrieval from legal texts...
research
10/15/2019

The NAI Suite – Drafting and Reasoning over Legal Texts

A prototype for automated reasoning over legal texts, called NAI, is pre...
research
08/24/2019

Automatic Text Summarization of Legal Cases: A Hybrid Approach

Manual Summarization of large bodies of text involves a lot of human eff...
research
10/16/2018

Multi-Task Deep Learning for Legal Document Translation, Summarization and Multi-Label Classification

The digitalization of the legal domain has been ongoing for a couple of ...
research
08/07/2021

Fine-tuning GPT-3 for Russian Text Summarization

Automatic summarization techniques aim to shorten and generalize informa...

Please sign up or login with your details

Forgot password? Click here to reset