High-Dimensional L_2Boosting: Rate of Convergence

02/29/2016
by   Ye Luo, et al.
0

Boosting is one of the most significant developments in machine learning. This paper studies the rate of convergence of L_2Boosting, which is tailored for regression, in a high-dimensional setting. Moreover, we introduce so-called “ post-Boosting”. This is a post-selection estimator which applies ordinary least squares to the variables selected in the first stage by L_2Boosting. Another variant is “ Orthogonal Boosting” where after each step an orthogonal projection is conducted. We show that both post-L_2Boosting and the orthogonal boosting achieve the same rate of convergence as LASSO in a sparse, high-dimensional setting. We show that the rate of convergence of the classical L_2Boosting depends on the design matrix described by a sparse eigenvalue constant. To show the latter results, we derive new approximation results for the pure greedy algorithm, based on analyzing the revisiting behavior of L_2Boosting. We also introduce feasible rules for early stopping, which can be easily implemented and used in applied work. Our results also allow a direct comparison between LASSO and boosting which has been missing from the literature. Finally, we present simulation studies and applications to illustrate the relevance of our theoretical results and to provide insights into the practical aspects of boosting. In these simulation studies, post-L_2Boosting clearly outperforms LASSO.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2022

Early stopping for L^2-boosting in high-dimensional linear models

Increasingly high-dimensional data sets require that estimation methods ...
research
02/10/2017

L_2Boosting for Economic Applications

In the recent years more and more high-dimensional data sets, where the ...
research
05/06/2015

Re-scale boosting for regression and classification

Boosting is a learning scheme that combines weak prediction rules to pro...
research
06/16/2021

Pre-processing with Orthogonal Decompositions for High-dimensional Explanatory Variables

Strong correlations between explanatory variables are problematic for hi...
research
12/13/2018

On the Differences between L2-Boosting and the Lasso

We prove that L2-Boosting lacks a theoretical property which is central ...
research
12/28/2017

Orthogonal Machine Learning for Demand Estimation: High Dimensional Causal Inference in Dynamic Panels

There has been growing interest in how economists can import machine lea...
research
01/30/2008

Recursive Bias Estimation and L_2 Boosting

This paper presents a general iterative bias correction procedure for re...

Please sign up or login with your details

Forgot password? Click here to reset