LEVEN: A Large-Scale Chinese Legal Event Detection Dataset

03/16/2022
by   Feng Yao, et al.
0

Recognizing facts is the most fundamental step in making judgments, hence detecting events in the legal documents is important to legal case analysis tasks. However, existing Legal Event Detection (LED) datasets only concern incomprehensive event types and have limited annotated data, which restricts the development of LED methods and their downstream applications. To alleviate these issues, we present LEVEN a large-scale Chinese LEgal eVENt detection dataset, with 8,116 legal documents and 150,977 human-annotated event mentions in 108 event types. Not only charge-related events, LEVEN also covers general events, which are critical for legal case understanding but neglected in existing LED datasets. To our knowledge, LEVEN is the largest LED dataset and has dozens of times the data scale of others, which shall significantly promote the training and evaluation of LED methods. The results of extensive experiments indicate that LED is challenging and needs further effort. Moreover, we simply utilize legal events as side information to promote downstream applications. The method achieves improvements of average 2.2 points precision in low-resource judgment prediction, and 1.5 points mean average precision in unsupervised case retrieval, which suggests the fundamentality of LED. The source code and dataset can be obtained from https://github.com/thunlp/LEVEN.

READ FULL TEXT

page 10

page 11

page 12

page 13

page 16

page 17

page 18

page 19

research
07/11/2023

U-CREAT: Unsupervised Case Retrieval using Events extrAcTion

The task of Prior Case Retrieval (PCR) in the legal domain is about auto...
research
04/28/2020

MAVEN: A Massive General Domain Event Detection Dataset

Event detection (ED), which identifies event trigger words and classifie...
research
04/02/2022

HLDC: Hindi Legal Documents Corpus

Many populous countries including India are burdened with a considerable...
research
11/25/2022

MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts

Event detection (ED) identifies and classifies event triggers from unstr...
research
11/15/2022

DeepParliament: A Legal domain Benchmark Dataset for Parliament Bills Prediction

This paper introduces DeepParliament, a legal domain Benchmark Dataset t...
research
08/12/2020

LogoDet-3K: A Large-Scale Image Dataset for Logo Detection

Logo detection has been gaining considerable attention because of its wi...
research
05/09/2020

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events

Along with the development of the modern smart city, human-centric video...

Please sign up or login with your details

Forgot password? Click here to reset