Private Inner Product Retrieval for Distributed Machine Learning

02/17/2019
by   Mohammad Hossein Mousavi, et al.
0

In this paper, we argue that in many basic algorithms for machine learning, including support vector machine (SVM) for classification, principal component analysis (PCA) for dimensionality reduction, and regression for dependency estimation, we need the inner products of the data samples, rather than the data samples themselves. Motivated by the above observation, we introduce the problem of private inner product retrieval for distributed machine learning, where we have a system including a database of some files, duplicated across some non-colluding servers. A user intends to retrieve a subset of specific size of the inner products of the data files with minimum communication load, without revealing any information about the identity of the requested subset. For achievability, we use the algorithms for multi-message private information retrieval. For converse, we establish that as the length of the files becomes large, the set of all inner products converges to independent random variables with uniform distribution, and derive the rate of convergence. To prove that, we construct special dependencies among sequences of the sets of all inner products with different length, which forms a time-homogeneous irreducible Markov chain, without affecting the marginal distribution. We show that this Markov chain has a uniform distribution as its unique stationary distribution, with rate of convergence dominated by the second largest eigenvalue of the transition probability matrix. This allows us to develop a converse, which converges to a tight bound in some cases, as the size of the files becomes large. While this converse is based on the one in multi-message private information retrieval, due to the nature of retrieving inner products instead of data itself some changes are made to reach the desired result.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2018

Multi-Message Private Information Retrieval with Private Side Information

We consider the problem of private information retrieval (PIR) where a s...
research
03/29/2019

Capacity of Quantum Private Information Retrieval with Collusion of All But One of Servers

Quantum private information retrieval (QPIR) is the problem to retrieve ...
research
11/07/2018

Private Information Retrieval Schemes with Regenerating Codes

A private information retrieval (PIR) scheme allows a user to retrieve a...
research
02/04/2023

k-server Byzantine-Resistant PIR Scheme with Optimal Download Rate and Optimal File Size

We consider the problem of designing a Private Information Retrieval (PI...
research
03/30/2020

Semantic Private Information Retrieval

We investigate the problem of semantic private information retrieval (se...
research
10/16/2022

New Secure Sparse Inner Product with Applications to Machine Learning

Sparse inner product (SIP) has the attractive property of overhead being...
research
02/02/2022

On Linear Separability under Linear Compression with Applications to Hard Support Vector Machine

This paper investigates the theoretical problem of maintaining linear se...

Please sign up or login with your details

Forgot password? Click here to reset