Distributed Sketching Methods for Privacy Preserving Regression

02/16/2020
by   Burak Bartan, et al.
0

In this work, we study distributed sketching methods for large scale regression problems. We leverage multiple randomized sketches for reducing the problem dimensions as well as preserving privacy and improving straggler resilience in asynchronous distributed systems. We derive novel approximation guarantees for classical sketching methods and analyze the accuracy of parameter averaging for distributed sketches. We consider random matrices including Gaussian, randomized Hadamard, uniform sampling and leverage score sampling in the distributed setting. Moreover, we propose a hybrid approach combining sampling and fast random projections for better computational efficiency. We illustrate the performance of distributed sketches in a serverless computing platform with large scale experiments.

READ FULL TEXT

Authors

page 1

page 2

page 3

page 4

03/18/2022

Distributed Sketching for Randomized Optimization: Exact Characterization, Concentration and Lower Bounds

We consider distributed optimization methods for problems where forming ...
10/14/2018

A New Theory for Sketching in Linear Regression

Large datasets create opportunities as well as analytic challenges. A re...
01/21/2022

Orthonormal Sketches for Secure Coded Regression

In this work, we propose a method for speeding up linear regression dist...
02/16/2020

Distributed Averaging Methods for Randomized Second Order Optimization

We consider distributed optimization problems where forming the Hessian ...
08/07/2018

A distributed regression analysis application based on SAS software Part II: Cox proportional hazards regression

Previous work has demonstrated the feasibility and value of conducting d...
04/07/2019

An Asynchronous, Decentralized Solution Framework for the Large Scale Unit Commitment Problem

With increased reliance on cyber infrastructure, large scale power netwo...
02/10/2015

Implementing Randomized Matrix Algorithms in Parallel and Distributed Environments

In this era of large-scale data, distributed systems built on top of clu...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.