Legal Document Retrieval using Document Vector Embeddings and Deep Learning

05/27/2018
by   Keet Sugathadasa, et al.
0

Domain specific information retrieval process has been a prominent and ongoing research in the field of natural language processing. Many researchers have incorporated different techniques to overcome the technical and domain specificity and provide a mature model for various domains of interest. The main bottleneck in these studies is the heavy coupling of domain experts, that makes the entire process to be time consuming and cumbersome. In this study, we have developed three novel models which are compared against a golden standard generated via the on line repositories provided, specifically for the legal domain. The three different models incorporated vector space representations of the legal domain, where document vector generation was done in two different mechanisms and as an ensemble of the above two. This study contains the research being carried out in the process of representing legal case documents into different vector spaces, whilst incorporating semantic word measures and natural language processing techniques. The ensemble model built in this study, shows a significantly higher accuracy level, which indeed proves the need for incorporation of domain specific semantic similarity measures into the information retrieval process. This study also shows, the impact of varying distribution of the word similarity measures, against varying document vector dimensions, which can lead to improvements in the process of legal information retrieval.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2022

Multi-stage Information Retrieval for Vietnamese Legal Texts

This study deals with the problem of information retrieval (IR) for Viet...
research
11/04/2020

JNLP Team: Deep Learning for Legal Processing in COLIEE 2020

We propose deep learning based methods for automatic systems of legal re...
research
12/08/2020

A Topological Method for Comparing Document Semantics

Comparing document semantics is one of the toughest tasks in both Natura...
research
12/21/2020

Cross-domain Retrieval in the Legal and Patent Domains: a Reproducibility Study

Domain specific search has always been a challenging information retriev...
research
11/22/2019

Use of Artificial Intelligence to Analyse Risk in Legal Documents for a Better Decision Support

Assessing risk for voluminous legal documents such as request for propos...
research
10/13/2020

Legal Document Classification: An Application to Law Area Prediction of Petitions to Public Prosecution Service

In recent years, there has been an increased interest in the application...

Please sign up or login with your details

Forgot password? Click here to reset