Learning Theory of Distributed Regression with Bias Corrected Regularization Kernel Network

08/07/2017
by   Zhengchu Guo, et al.
0

Distributed learning is an effective way to analyze big data. In distributed regression, a typical approach is to divide the big data into multiple blocks, apply a base regression algorithm on each of them, and then simply average the output functions learnt from these blocks. Since the average process will decrease the variance, not the bias, bias correction is expected to improve the learning performance if the base regression algorithm is a biased one. Regularization kernel network is an effective and widely used method for nonlinear regression analysis. In this paper we will investigate a bias corrected version of regularization kernel network. We derive the error bounds when it is applied to a single data set and when it is applied as a base algorithm in distributed regression. We show that, under certain appropriate conditions, the optimal learning rates can be reached in both situations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2020

Optimal Rates of Distributed Regression with Imperfect Kernels

Distributed machine learning systems have been receiving increasing atte...
research
03/15/2016

Bias Correction for Regularized Regression and its Application in Learning with Streaming Data

We propose an approach to reduce the bias of ridge regression and regula...
research
05/05/2015

On the Feasibility of Distributed Kernel Regression for Big Data

In modern scientific research, massive datasets with huge numbers of obs...
research
08/11/2016

Distributed learning with regularized least squares

We study distributed learning with the least squares regularization sche...
research
11/27/2019

A race-DC in Big Data

The strategy of divide-and-combine (DC) has been widely used in the area...
research
02/10/2018

Document Classification Using Distributed Machine Learning

In this paper, we investigate the performance and success rates of Naïve...
research
08/07/2018

A distributed regression analysis application based on SAS software Part II: Cox proportional hazards regression

Previous work has demonstrated the feasibility and value of conducting d...

Please sign up or login with your details

Forgot password? Click here to reset