Online Updating Huber Robust Regression for Big Data Streams

09/05/2022
by   Chunbai Tao, et al.
0

Big data has grasped great attention in different fields over recent years. In the context of computer memory limitation, how to do regression on big data streams and solve outlier problems reasonably is worth discussing. Take this as a starting point, this article proposes an Online Updating Huber Robust Regression algorithm. By integrating Huber regression into Online Updating structure, it can achieve continuously updating on historical data using key features extracted from new data subsets and be robust to heavy-tailed distribution, cases with heterogeneous error and outliers. The Online Updating estimator obtained is asymptotically equivalent with Oracle estimator calculated by the entire data and has a lower computation complexity. We also execute simulations and real data analysis. Results in experiments shows that our algorithm performs outstandingly among other 5 algorithms in estimation and calculation efficiency, being feasible to real application.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2021

Online Updating Statistics for Heterogenous Updating Regressions via Homogenization Techniques

Under the environment of big data streams, it is a common situation wher...
research
08/24/2023

An Efficient Data Analysis Method for Big Data using Multiple-Model Linear Regression

This paper introduces a new data analysis method for big data using a ne...
research
10/11/2022

Renewable Learning for Multiplicative Regression with Streaming Datasets

When large amounts of data continuously arrive in streams, online updati...
research
01/21/2021

A General Framework of Online Updating Variable Selection for Generalized Linear Models with Streaming Datasets

In the research field of big data, one of important issues is how to rec...
research
05/08/2019

Robust regression based on shrinkage estimators

A robust estimator is proposed for the parameters that characterize the ...
research
08/20/2020

Unified Rules of Renewable Weighted Sums for Various Online Updating Estimations

This paper establishes unified frameworks of renewable weighted sums (RW...
research
05/23/2020

A New Algorithm using Component-wise Adaptive Trimming For Robust Mixture Regression

Mixture regression provides a statistical model for teasing out latent h...

Please sign up or login with your details

Forgot password? Click here to reset