SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval

04/22/2023
by   Haitao Li, et al.
0

Legal case retrieval, which aims to find relevant cases for a query case, plays a core role in the intelligent legal system. Despite the success that pre-training has achieved in ad-hoc retrieval tasks, effective pre-training strategies for legal case retrieval remain to be explored. Compared with general documents, legal case documents are typically long text sequences with intrinsic logical structures. However, most existing language models have difficulty understanding the long-distance dependencies between different structures. Moreover, in contrast to the general retrieval, the relevance in the legal domain is sensitive to key legal elements. Even subtle differences in key legal elements can significantly affect the judgement of relevance. However, existing pre-trained language models designed for general purposes have not been equipped to handle legal elements. To address these issues, in this paper, we propose SAILER, a new Structure-Aware pre-traIned language model for LEgal case Retrieval. It is highlighted in the following three aspects: (1) SAILER fully utilizes the structural information contained in legal case documents and pays more attention to key legal elements, similar to how legal experts browse legal case documents. (2) SAILER employs an asymmetric encoder-decoder architecture to integrate several different pre-training objectives. In this way, rich semantic information across tasks is encoded into dense vectors. (3) SAILER has powerful discriminative ability, even without any legal annotation data. It can distinguish legal cases with different charges accurately. Extensive experiments over publicly available legal benchmarks demonstrate that our approach can significantly outperform previous state-of-the-art methods in legal case retrieval.

READ FULL TEXT
research
05/09/2023

CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding

Legal case retrieval is a critical process for modern legal information ...
research
10/11/2022

Legal Element-oriented Modeling with Multi-view Contrastive Learning for Legal Case Retrieval

Legal case retrieval, which aims to retrieve relevant cases given a quer...
research
11/10/2019

Searching for Legal Clauses by Analogy. Few-shot Semantic Retrieval Shared Task

We introduce a novel shared task for semantic retrieval from legal texts...
research
09/06/2023

Prompt-based Effective Input Reformulation for Legal Case Retrieval

Legal case retrieval plays an important role for legal practitioners to ...
research
11/15/2022

Exploiting Contrastive Learning and Numerical Evidence for Improving Confusing Legal Judgment Prediction

Given the fact description text of a legal case, legal judgment predicti...
research
12/15/2022

MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers

Dense retrieval aims to map queries and passages into low-dimensional ve...
research
01/29/2023

Diverse legal case search

In last decades, legal case search has received more and more attention....

Please sign up or login with your details

Forgot password? Click here to reset