BLAZE: Blazing Fast Privacy-Preserving Machine Learning

04/06/2020
by   Ajith Suresh, et al.
0

Machine learning tools have illustrated their potential in many significant sectors such as healthcare and finance, to aide in deriving useful inferences. The sensitive and confidential nature of the data, in such sectors, raises natural concerns for the privacy of data. This motivated the area of Privacy-preserving Machine Learning (PPML) where privacy of the data is guaranteed. Typically, ML techniques require large computing power, which leads clients with limited infrastructure to rely on the method of Secure Outsourced Computation (SOC). In SOC setting, the computation is outsourced to a set of specialized and powerful cloud servers and the service is availed on a pay-per-use basis. In this work, we explore PPML techniques in the SOC setting for widely used ML algorithms-- Linear Regression, Logistic Regression, and Neural Networks. We propose BLAZE, a blazing fast PPML framework in the three server setting tolerating one malicious corruption over a ring (Z2ℓ). BLAZE achieves the stronger security guarantee of fairness (all honest servers get the output whenever the corrupt server obtains the same). Leveraging an input-independent preprocessing phase, BLAZE has a fast input-dependent online phase relying on efficient PPML primitives such as: (i) A dot product protocol for which the communication in the online phase is independent of the vector size, the first of its kind in the three server setting; (ii) A method for truncation that shuns evaluating expensive circuit for Ripple Carry Adders (RCA) and achieves a constant round complexity. This improves over the truncation method of ABY3 (Mohassel et al., CCS 2018) that uses RCA and consumes a round complexity that is of the order of the depth of RCA (which is the same as the underlying ring size). An extensive benchmarking of BLAZE for the aforementioned ML algorithms over a 64-bit ring in both WAN and LAN settings shows massive improvements over ABY3. Concretely, we observe improvements up to 333× for Linear Regression, 53× for Logistic Regression and 276× for Neural Networks over WAN. Similarly, we show improvements up to 2610× for Linear Regression, 54× for Logistic Regression and 278× for Neural Networks over LAN.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

12/05/2019

Trident: Efficient 4PC Framework for Privacy Preserving Machine Learning

Machine learning has started to be deployed in fields such as healthcare...
05/29/2020

SWIFT: Super-fast and Robust Privacy-Preserving Machine Learning

Performing ML computation on private data while maintaining data privacy...
12/05/2019

ASTRA: High Throughput 3PC over Rings with Application to Secure Prediction

The concrete efficiency of secure computation has been the focus of many...
09/16/2019

VeriML: Enabling Integrity Assurances and Fair Payments for Machine Learning as a Service

Machine Learning as a Service (MLaaS) allows clients with limited resour...
10/04/2019

PINFER: Privacy-Preserving Inference for Machine Learning

The foreseen growing role of outsourced machine learning services is rai...
01/07/2021

Machine Learning on Cloud with Blockchain: A Secure, Verifiable and Fair Approach to Outsource the Linear Regression

Linear Regression (LR) is a classical machine learning algorithm which h...
03/22/2022

SPRITE: A Scalable Privacy-Preserving and Verifiable Collaborative Learning for Industrial IoT

Recently collaborative learning is widely applied to model sensitive dat...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.