Histogram Transform Ensembles for Large-scale Regression

12/08/2019
by   Hanyuan Hang, et al.
0

We propose a novel algorithm for large-scale regression problems named histogram transform ensembles (HTE), composed of random rotations, stretchings, and translations. First of all, we investigate the theoretical properties of HTE when the regression function lies in the Hölder space C^k,α, k ∈N_0, α∈ (0,1]. In the case that k=0, 1, we adopt the constant regressors and develop the naïve histogram transforms (NHT). Within the space C^0,α, although almost optimal convergence rates can be derived for both single and ensemble NHT, we fail to show the benefits of ensembles over single estimators theoretically. In contrast, in the subspace C^1,α, we prove that if d ≥ 2(1+α)/α, the lower bound of the convergence rates for single NHT turns out to be worse than the upper bound of the convergence rates for ensemble NHT. In the other case when k ≥ 2, the NHT may no longer be appropriate in predicting smoother regression functions. Instead, we apply kernel histogram transforms (KHT) equipped with smoother regressors such as support vector machines (SVMs), and it turns out that both single and ensemble KHT enjoy almost optimal convergence rates. Then we validate the above theoretical results by numerical experiments. On the one hand, simulations are conducted to elucidate that ensemble NHT outperform single NHT. On the other hand, the effects of bin sizes on accuracy of both NHT and KHT also accord with theoretical analysis. Last but not least, in the real-data experiments, comparisons between the ensemble KHT, equipped with adaptive histogram transforms, and other state-of-the-art large-scale regression estimators verify the effectiveness and accuracy of our algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2019

Histogram Transform Ensembles for Density Estimation

We investigate an algorithm named histogram transform ensembles (HTE) de...
research
12/05/2021

Local Adaptivity of Gradient Boosting in Histogram Transform Ensemble Learning

In this paper, we propose a gradient boosting algorithm called adaptive ...
research
06/03/2021

Gradient Boosted Binary Histogram Ensemble for Large-scale Regression

In this paper, we propose a gradient boosting algorithm for large-scale ...
research
06/10/2021

GBHT: Gradient Boosting Histogram Transform for Density Estimation

In this paper, we propose a density estimation algorithm called Gradient...
research
01/18/2011

Convergence rates of efficient global optimization algorithms

Efficient global optimization is the problem of minimizing an unknown fu...
research
06/03/2020

Tectonic environments of South American porphyry copper magmatism through time revealed by spatiotemporal data mining

Porphyry ore deposits are known to be associated with arc magmatism on t...
research
04/01/2017

Stochastic L-BFGS: Improved Convergence Rates and Practical Acceleration Strategies

We revisit the stochastic limited-memory BFGS (L-BFGS) algorithm. By pro...

Please sign up or login with your details

Forgot password? Click here to reset