Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions

06/14/2023
by   Abdulrahman Diaa, et al.
0

Machine Learning as a Service (MLaaS) is an increasingly popular design where a company with abundant computing resources trains a deep neural network and offers query access for tasks like image classification. The challenge with this design is that MLaaS requires the client to reveal their potentially sensitive queries to the company hosting the model. Multi-party computation (MPC) protects the client's data by allowing encrypted inferences. However, current approaches suffer prohibitively large inference times. The inference time bottleneck in MPC is the evaluation of non-linear layers such as ReLU activation functions. Motivated by the success of previous work co-designing machine learning and MPC aspects, we develop an activation function co-design. We replace all ReLUs with a polynomial approximation and evaluate them with single-round MPC protocols, which give state-of-the-art inference times in wide-area networks. Furthermore, to address the accuracy issues previously encountered with polynomial activations, we propose a novel training algorithm that gives accuracy competitive with plaintext models. Our evaluation shows between 4 and 90× speedups in inference time on large models with up to 23 million parameters while maintaining competitive inference accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2022

AESPA: Accuracy Preserving Low-degree Polynomial Activation for Fast Private Inference

Hybrid private inference (PI) protocol, which synergistically utilizes b...
research
09/09/2023

Compact: Approximating Complex Activation Functions for Secure Computation

Secure multi-party computation (MPC) techniques can be used to provide d...
research
07/26/2021

Sisyphus: A Cautionary Tale of Using Low-Degree Polynomial Activations in Privacy-Preserving Deep Learning

Privacy concerns in client-server machine learning have given rise to pr...
research
06/01/2019

SHE: A Fast and Accurate Privacy-Preserving Deep Neural Network Via Leveled TFHE and Logarithmic Data Representation

Homomorphic Encryption (HE) is one of the most promising security soluti...
research
02/13/2023

DASH: Accelerating Distributed Private Machine Learning Inference with Arithmetic Garbled Circuits

The adoption of machine learning solutions is rapidly increasing across ...
research
01/28/2021

S++: A Fast and Deployable Secure-Computation Framework for Privacy-Preserving Neural Network Training

We introduce S++, a simple, robust, and deployable framework for trainin...
research
08/12/2019

nGraph-HE2: A High-Throughput Framework for Neural Network Inference on Encrypted Data

In previous work, Boemer et al. introduced nGraph-HE, an extension to th...

Please sign up or login with your details

Forgot password? Click here to reset