DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression

04/16/2016
by   Parameswaran Raman, et al.
0

Scaling multinomial logistic regression to datasets with very large number of data points and classes has not been trivial. This is primarily because one needs to compute the log-partition function on every data point. This makes distributing the computation hard. In this paper, we present a distributed stochastic gradient descent based optimization method (DS-MLR) for scaling up multinomial logistic regression problems to very large data. Our algorithm exploits double-separability, an attractive property we observe in the objective functions of several models in machine learning, that allows us to achieve both data as well as model parallelism simultaneously. In addition to being parallelizable, our algorithm can also easily be made asynchronous. We demonstrate the effectiveness of our method empirically on several real-world datasets, for instance a reddit dataset with data and parameter sizes of 200 GB and 300 GB respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2014

Distributed Coordinate Descent for L1-regularized Logistic Regression

Solving logistic regression with L1-regularization in distributed settin...
research
02/16/2018

Distributed Stochastic Optimization via Adaptive Stochastic Gradient Descent

Stochastic convex optimization algorithms are the most popular way to tr...
research
04/05/2023

Distributed Logistic Regression for Massive Data with Rare Events

Large-scale rare events data are commonly encountered in practice. To ta...
research
04/28/2023

Online Platt Scaling with Calibeating

We present an online post-hoc calibration method, called Online Platt Sc...
research
07/24/2017

Stochastic Gradient Descent for Relational Logistic Regression via Partial Network Crawls

Research in statistical relational learning has produced a number of met...
research
06/21/2019

Trade-offs in Large-Scale Distributed Tuplewise Estimation and Learning

The development of cluster computing frameworks has allowed practitioner...
research
06/05/2019

Data Sketching for Faster Training of Machine Learning Models

Many machine learning problems reduce to the problem of minimizing an ex...

Please sign up or login with your details

Forgot password? Click here to reset