Building Legal Datasets

11/03/2021
by   Jerrold Soh, et al.
0

Data-centric AI calls for better, not just bigger, datasets. As data protection laws with extra-territorial reach proliferate worldwide, ensuring datasets are legal is an increasingly crucial yet overlooked component of “better”. To help dataset builders become more willing and able to navigate this complex legal space, this paper reviews key legal obligations surrounding ML datasets, examines the practical impact of data laws on ML pipelines, and offers a framework for building legal datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2021

Crowdsourced Databases and Sui Generis Rights

In this study we propose a new concept of databases (crowdsourced databa...
research
09/29/2020

Legal Judgment Prediction (LJP) Amid the Advent of Autonomous AI Legal Reasoning

Legal Judgment Prediction (LJP) is a longstanding and open topic in the ...
research
11/01/2022

Should I disclose my dataset? Caveats between reproducibility and individual data rights

Natural language processing techniques have helped domain experts solve ...
research
10/04/2017

Automatic Taxonomy Generation - A Use-Case in the Legal Domain

A key challenge in the legal domain is the adaptation and representation...
research
08/30/2023

Is the U.S. Legal System Ready for AI's Challenges to Human Values?

Our interdisciplinary study investigates how effectively U.S. laws confr...
research
07/06/2021

An NLG pipeline for a legal expert system: a work in progress

We present the NLG component for L4, a prototype domain-specific languag...
research
02/21/2018

Artificial Intelligence and Legal Liability

A recent issue of a popular computing journal asked which laws would app...

Please sign up or login with your details

Forgot password? Click here to reset