Legal Case Document Similarity: You Need Both Network and Text

09/26/2022
by   Paheli Bhattacharya, et al.
0

Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task – citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This approach misses important signals inherent in Statutes (written laws of a jurisdiction). In this work, we propose Hier-SPCNet that augments PCNet with a heterogeneous network of Statutes. We incorporate domain knowledge for legal document similarity into Hier-SPCNet, thereby obtaining state-of-the-art results for network-based legal document similarity. Both textual and network similarity provide important signals for legal case similarity; but till now, only trivial attempts have been made to unify the two signals. In this work, we apply several methods for combining textual and network information for estimating legal case similarity. We perform extensive experiments over legal case documents from the Indian judiciary, where the gold standard similarity between document-pairs is judged by law experts from two reputed Law institutes in India. Our experiments establish that our proposed network-based methods significantly improve the correlation with domain experts' opinion when compared to the existing methods for network-based legal document similarity. Our best-performing combination method (that combines network-based and text-based similarity) improves the correlation with domain experts' opinion by 11.8 best network-based method. We also establish that our best-performing method can be used to recommend / retrieve citable and similar cases for a source (query) case, which are well appreciated by legal experts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2020

Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Case Document Similarity

Computing similarity between two legal case documents is an important an...
research
03/03/2022

LegalVis: Exploring and Inferring Precedent Citations in Legal Documents

To reduce the number of pending cases and conflicting rulings in the Bra...
research
06/20/2021

Context-Aware Legal Citation Recommendation using Deep Learning

Lawyers and judges spend a large amount of time researching the proper l...
research
05/29/2023

Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approaches

The Brazilian judiciary has a large workload, resulting in a long time t...
research
07/19/2021

Unsupervised Identification of Relevant Prior Cases

Document retrieval has taken its role in almost all domains of knowledge...
research
05/25/2023

Prototype-Based Interpretability for Legal Citation Prediction

Deep learning has made significant progress in the past decade, and demo...
research
07/29/2023

Analysing the Resourcefulness of the Paragraph for Precedence Retrieval

Developing methods for extracting relevant legal information to aid lega...

Please sign up or login with your details

Forgot password? Click here to reset