Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents

12/21/2021
by   Hannes Westermann, et al.
0

Human-performed annotation of sentences in legal documents is an important prerequisite to many machine learning based systems supporting legal tasks. Typically, the annotation is done sequentially, sentence by sentence, which is often time consuming and, hence, expensive. In this paper, we introduce a proof-of-concept system for annotating sentences "laterally." The approach is based on the observation that sentences that are similar in meaning often have the same label in terms of a particular type system. We use this observation in allowing annotators to quickly view and annotate sentences that are semantically similar to a given sentence, across an entire corpus of documents. Here, we present the interface of the system and empirically evaluate the approach. The experiments show that lateral annotation has the potential to make the annotation process quicker and more consistent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2014

Extraction of Salient Sentences from Labelled Documents

We present a hierarchical convolutional document model with an architect...
research
05/08/2023

Unlocking Practical Applications in Legal Domain: Evaluation of GPT for Zero-Shot Semantic Annotation of Legal Texts

We evaluated the capability of a state-of-the-art generative pre-trained...
research
11/13/2019

Identification of Rhetorical Roles of Sentences in Indian Legal Judgments

Automatically understanding the rhetorical roles of sentences in a legal...
research
10/10/2021

What Makes Sentences Semantically Related: A Textual Relatedness Dataset and Empirical Study

The degree of semantic relatedness (or, closeness in meaning) of two uni...
research
09/10/2018

Identifying Relationships Among Sentences in Court Case Transcripts Using Discourse Relations

Case Law has a significant impact on the proceedings of legal cases. The...
research
04/04/2023

An interpretability framework for Similar case matching

Similar Case Matching (SCM) is designed to determine whether two cases a...
research
07/22/2020

Exploratory Search with Sentence Embeddings

Exploratory search aims to guide users through a corpus rather than pinp...

Please sign up or login with your details

Forgot password? Click here to reset