Almost Linear Constant-Factor Sketching for ℓ_1 and Logistic Regression

03/31/2023
by   Alexander Munteanu, et al.
0

We improve upon previous oblivious sketching and turnstile streaming results for ℓ_1 and logistic regression, giving a much smaller sketching dimension achieving O(1)-approximation and yielding an efficient optimization problem in the sketch space. Namely, we achieve for any constant c>0 a sketching dimension of Õ(d^1+c) for ℓ_1 regression and Õ(μ d^1+c) for logistic regression, where μ is a standard measure that captures the complexity of compressing the data. For ℓ_1-regression our sketching dimension is near-linear and improves previous work which either required Ω(log d)-approximation with this sketching dimension, or required a larger poly(d) number of rows. Similarly, for logistic regression previous work had worse poly(μ d) factors in its sketching dimension. We also give a tradeoff that yields a 1+ε approximation in input sparsity time by increasing the total size to (dlog(n)/ε)^O(1/ε) for ℓ_1 and to (μ dlog(n)/ε)^O(1/ε) for logistic regression. Finally, we show that our sketch can be extended to approximate a regularized version of logistic regression where the data-dependent regularizer corresponds to the variance of the individual logistic losses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2021

Oblivious sketching for logistic regression

What guarantees are possible for solving logistic regression in one pass...
research
02/26/2019

Logarithmic Regret for parameter-free Online Logistic Regression

We consider online optimization procedures in the context of logistic re...
research
03/24/2023

Feature Space Sketching for Logistic Regression

We present novel bounds for coreset construction, feature selection, and...
research
08/07/2018

A distributed regression analysis application based on SAS software. Part I: Linear and logistic regression

Previous work has demonstrated the feasibility and value of conducting d...
research
04/13/2021

Modeling the dynamics of language change: logistic regression, Piotrowski's law, and a handful of examples in Polish

The study discusses modeling diachronic processes by logistic regression...
research
05/27/2019

On approximating dropout noise injection

This paper examines the assumptions of the derived equivalence between d...
research
05/24/2023

Optimal subsampling for large scale Elastic-net regression

Datasets with sheer volume have been generated from fields including com...

Please sign up or login with your details

Forgot password? Click here to reset