Conditional Sparse ℓ_p-norm Regression With Optimal Probability

06/26/2018
by   John Hainline, et al.
0

We consider the following conditional linear regression problem: the task is to identify both (i) a k-DNF condition c and (ii) a linear rule f such that the probability of c is (approximately) at least some given bound μ, and f minimizes the ℓ_p loss of predicting the target z in the distribution of examples conditioned on c. Thus, the task is to identify a portion of the distribution on which a linear rule can provide a good fit. Algorithms for this task are useful in cases where simple, learnable rules only accurately model portions of the distribution. The prior state-of-the-art for such algorithms could only guarantee finding a condition of probability Ω(μ/n^k) when a condition of probability μ exists, and achieved an O(n^k)-approximation to the target loss, where n is the number of Boolean attributes. Here, we give efficient algorithms for solving this task with a condition c that nearly matches the probability of the ideal condition, while also improving the approximation to the target loss. We also give an algorithm for finding a k-DNF reference class for prediction at a given query point, that obtains a sparse regression fit that has loss within O(n^k) of optimal among all sparse regression parameters and sufficiently large k-DNF reference classes containing the query point.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

Conditional Linear Regression for Heterogeneous Covariances

Often machine learning and statistical models will attempt to describe t...
research
08/18/2016

Conditional Sparse Linear Regression

Machine learning and statistics typically focus on building models that ...
research
06/06/2018

Conditional Linear Regression

Work in machine learning and statistics commonly focuses on building mod...
research
06/29/2022

Hardness and Algorithms for Robust and Sparse Optimization

We explore algorithms and limitations for sparse optimization problems s...
research
08/01/2019

Sparse Regression via Range Counting

The sparse regression problem, also known as best subset selection probl...
research
06/07/2019

Approximately Strategyproof Tournament Rules: On Large Manipulating Sets and Cover-Consistence

We consider the manipulability of tournament rules, in which n teams pla...
research
02/24/2021

HiPaR: Hierarchical Pattern-aided Regression

We introduce HiPaR, a novel pattern-aided regression method for tabular ...

Please sign up or login with your details

Forgot password? Click here to reset