IPProtect: protecting the intellectual property of visual datasets during data valuation

12/22/2022
by   Gursimran Singh, et al.
4

Data trading is essential to accelerate the development of data-driven machine learning pipelines. The central problem in data trading is to estimate the utility of a seller's dataset with respect to a given buyer's machine learning task, also known as data valuation. Typically, data valuation requires one or more participants to share their raw dataset with others, leading to potential risks of intellectual property (IP) violations. In this paper, we tackle the novel task of preemptively protecting the IP of datasets that need to be shared during data valuation. First, we identify and formalize two kinds of novel IP risks in visual datasets: data-item (image) IP and statistical (dataset) IP. Then, we propose a novel algorithm to convert the raw dataset into a sanitized version, that provides resistance to IP violations, while at the same time allowing accurate data valuation. The key idea is to limit the transfer of information from the raw dataset to the sanitized dataset, thereby protecting against potential intellectual property violations. Next, we analyze our method for the likely existence of a solution and immunity against reconstruction attacks. Finally, we conduct extensive experiments on three computer vision datasets demonstrating the advantages of our method in comparison to other baselines.

READ FULL TEXT

page 12

page 13

research
07/19/2021

GNN4IP: Graph Neural Network for Hardware Intellectual Property Piracy Detection

Aggressive time-to-market constraints and enormous hardware design and f...
research
10/12/2022

IPv6 over Bluetooth Advertisements: An alternative approach to IP over BLE

The IPv6 over Bluetooth Low Energy (BLE) standard defines the transfer o...
research
10/14/2020

Measuring the originality of intellectual property assets based on machine learning outputs

Originality criteria are frequently used to assess the validity of intel...
research
11/02/2021

System Combination for Grammatical Error Correction Based on Integer Programming

In this paper, we propose a system combination method for grammatical er...
research
10/09/2020

A Graph Neural Network Approach for Scalable and Dynamic IP Similarity in Enterprise Networks

Measuring similarity between IP addresses is an important task in the da...
research
08/18/2022

Intellectual Property Evaluation Utilizing Machine Learning

Intellectual properties is increasingly important in the economic develo...

Please sign up or login with your details

Forgot password? Click here to reset