Learning Feature Nonlinearities with Non-Convex Regularized Binned Regression

05/20/2017
by   Samet Oymak, et al.
0

For various applications, the relations between the dependent and independent variables are highly nonlinear. Consequently, for large scale complex problems, neural networks and regression trees are commonly preferred over linear models such as Lasso. This work proposes learning the feature nonlinearities by binning feature values and finding the best fit in each quantile using non-convex regularized linear regression. The algorithm first captures the dependence between neighboring quantiles by enforcing smoothness via piecewise-constant/linear approximation and then selects a sparse subset of good features. We prove that the proposed algorithm is statistically and computationally efficient. In particular, it achieves linear rate of convergence while requiring near-minimal number of samples. Evaluations on synthetic and real datasets demonstrate that algorithm is competitive with current state-of-the-art and accurately learns feature nonlinearities. Finally, we explore an interesting connection between the binning stage of our algorithm and sparse Johnson-Lindenstrauss matrices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2019

Efficient Regularized Piecewise-Linear Regression Trees

We present a detailed analysis of the class of regression decision tree ...
research
06/04/2021

Adiabatic Quantum Feature Selection for Sparse Linear Regression

Linear regression is a popular machine learning approach to learn and pr...
research
05/10/2023

Computationally Efficient and Statistically Optimal Robust High-Dimensional Linear Regression

High-dimensional linear regression under heavy-tailed noise or outlier c...
research
02/23/2017

Horseshoe Regularization for Feature Subset Selection

Feature subset selection arises in many high-dimensional applications of...
research
04/26/2017

Stochastic Orthant-Wise Limited-Memory Quasi-Newton Methods

The ℓ_1-regularized sparse model has been popular in machine learning so...
research
03/10/2021

Piecewise linear regression and classification

This paper proposes a method for solving multivariate regression and cla...
research
09/19/2022

Sparse Interaction Additive Networks via Feature Interaction Detection and Sparse Selection

There is currently a large gap in performance between the statistically ...

Please sign up or login with your details

Forgot password? Click here to reset