An Efficient Data Analysis Method for Big Data using Multiple-Model Linear Regression

08/24/2023
by   Bohan Lyu, et al.
0

This paper introduces a new data analysis method for big data using a newly defined regression model named multiple model linear regression(MMLR), which separates input datasets into subsets and construct local linear regression models of them. The proposed data analysis method is shown to be more efficient and flexible than other regression based methods. This paper also proposes an approximate algorithm to construct MMLR models based on (ϵ,δ)-estimator, and gives mathematical proofs of the correctness and efficiency of MMLR algorithm, of which the time complexity is linear with respect to the size of input datasets. This paper also empirically implements the method on both synthetic and real-world datasets, the algorithm shows to have comparable performance to existing regression methods in many cases, while it takes almost the shortest time to provide a high prediction accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2021

Orthogonal Subsampling for Big Data Linear Regression

The dramatic growth of big datasets presents a new challenge to data sto...
research
11/02/2020

Coresets for Regressions with Panel Data

This paper introduces the problem of coresets for regression problems to...
research
09/05/2022

Online Updating Huber Robust Regression for Big Data Streams

Big data has grasped great attention in different fields over recent yea...
research
06/07/2020

Sources of high leverage in linear regression model

Some reasons for high leverage are analytically investigated by decompos...
research
07/06/2023

OLR-WA Online Regression with Weighted Average

Machine Learning requires a large amount of training data in order to bu...
research
05/01/2021

Commercials Sales Prediction Using Multiple Linear Regression

Commercials have always been one of the most important medium for a comp...
research
02/09/2018

Large Scale Constrained Linear Regression Revisited: Faster Algorithms via Preconditioning

In this paper, we revisit the large-scale constrained linear regression ...

Please sign up or login with your details

Forgot password? Click here to reset