Data-efficient End-to-end Information Extraction for Statistical Legal Analysis

11/03/2022
by   Wonseok Hwang, et al.
0

Legal practitioners often face a vast amount of documents. Lawyers, for instance, search for appropriate precedents favorable to their clients, while the number of legal precedents is ever-growing. Although legal search engines can assist finding individual target documents and narrowing down the number of candidates, retrieved information is often presented as unstructured text and users have to examine each document thoroughly which could lead to information overloading. This also makes their statistical analysis challenging. Here, we present an end-to-end information extraction (IE) system for legal documents. By formulating IE as a generation task, our system can be easily applied to various tasks without domain-specific engineering effort. The experimental results of four IE tasks on Korean precedents shows that our IE system can achieve competent scores (-2.3 on average) compared to the rule-based baseline with as few as 50 training examples per task and higher score (+5.4 on average) with 200 examples. Finally, our statistical analysis on two case categories–drunk driving and fraud–with 35k precedents reveals the resulting structured information from our IE system faithfully reflects the macroscopic features of Korean legal system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2023

NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus

The statistical analysis of large scale legal corpus can provide valuabl...
research
12/04/2018

Information Extraction Framework to Build Legislation Network

This paper concerns an Information Extraction process for building a dyn...
research
12/05/2022

Legal Prompt Engineering for Multilingual Legal Judgement Prediction

Legal Prompt Engineering (LPE) or Legal Prompting is a process to guide ...
research
12/14/2019

Long-length Legal Document Classification

One of the principal tasks of machine learning with major applications i...
research
07/25/2023

An Intent Taxonomy of Legal Case Retrieval

Legal case retrieval is a special Information Retrieval (IR) task focusi...
research
09/14/2018

Automatic Catchphrase Extraction from Legal Case Documents via Scoring using Deep Neural Networks

In this paper, we present a method of automatic catchphrase extracting f...
research
06/03/2023

TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain

State-of-the-art offline Optical Character Recognition (OCR) frameworks ...

Please sign up or login with your details

Forgot password? Click here to reset