An Empirical Study on Transfer Learning for Privilege Review

12/16/2021
by   Haozhen Zhao, et al.
0

Protecting privileged communications and data from inadvertent disclosure is a paramount task in the US legal practice. Traditionally counsels rely on keyword searching and manual review to identify privileged documents in cases. As data volumes increase, this approach becomes less and less defensible in costs. Machine learning methods have been used in identifying privilege documents. Given the generalizable nature of privilege in legal cases, we hypothesize that transfer learning can capitalize knowledge learned from existing labeled data to identify privilege documents without requiring labeling new training data. In this paper, we study both traditional machine learning models and deep learning models based on BERT for privilege document classification tasks in legal document review, and we examine the effectiveness of transfer learning in privilege model on three real world datasets with privilege labels. Our results show that BERT model outperforms the industry standard logistic regression algorithm and transfer learning models can achieve decent performance on datasets in same or close domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2020

Hierarchical models vs. transfer learning for document-level sentiment classification

Documents are composed of smaller pieces - paragraphs, sentences, and to...
research
02/09/2021

CNN Application in Detection of Privileged Documents in Legal Document Review

Protecting privileged communications and data from disclosure is paramou...
research
12/19/2019

Image Analytics for Legal Document Review: A Transfer Learning Approach

Though technology assisted review in electronic discovery has been focus...
research
02/02/2022

Detecting Privacy Requirements from User Stories with NLP Transfer Learning Models

To provide privacy-aware software systems, it is crucial to consider pri...
research
03/06/2020

Transfer Learning for Information Extraction with Limited Data

This paper presents a practical approach to fine-grained information ext...
research
06/18/2021

On Minimizing Cost in Legal Document Review Workflows

Technology-assisted review (TAR) refers to human-in-the-loop machine lea...

Please sign up or login with your details

Forgot password? Click here to reset