A Dataset of German Legal Documents for Named Entity Recognition

03/29/2020
by   Elena Leitner, et al.
0

We describe a dataset developed for Named Entity Recognition in German federal court decisions. It consists of approx. 67,000 sentences with over 2 million tokens. The resource contains 54,000 manually annotated entities, mapped to 19 fine-grained semantic classes: person, judge, lawyer, country, city, street, landscape, organization, company, institution, court, brand, law, ordinance, European legal norm, regulation, contract, court decision, and legal literature. The legal documents were, furthermore, automatically annotated with more than 35,000 TimeML-based time expressions. The dataset, which is available under a CC-BY 4.0 license in the CoNNL-2002 format, was developed for training an NER service for German legal documents in the EU project Lynx.

READ FULL TEXT
research
03/07/2023

German BERT Model for Legal Named Entity Recognition

The use of BERT, one of the most popular language models, has led to imp...
research
05/20/2023

CDJUR-BR – A Golden Collection of Legal Document from Brazilian Justice with Fine-Grained Named Entities

A basic task for most Legal Artificial Intelligence (Legal AI) applicati...
research
11/18/2020

Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts

In contract analysis and contract automation, a knowledge base (KB) of l...
research
05/09/2023

The Perfect Victim: Computational Analysis of Judicial Attitudes towards Victims of Sexual Violence

We develop computational models to analyze court statements in order to ...
research
07/29/2019

Legal entity recognition in an agglutinating language and document connection network for EU Legislation and EU/Hungarian Case Law

We have developed an application aiming at federated search for EU and H...
research
10/15/2018

Named-Entity Linking Using Deep Learning For Legal Documents: A Transfer Learning Approach

In the legal domain it is important to differentiate between words in ge...
research
01/16/2023

Towards an Automatic Consolidation of French Law

We present preliminary results about Legistix, a tool we are developing ...

Please sign up or login with your details

Forgot password? Click here to reset