Divide-and-conquer methods for big data analysis

02/22/2021
by   Xueying Chen, et al.
0

In the context of big data analysis, the divide-and-conquer methodology refers to a multiple-step process: first splitting a data set into several smaller ones; then analyzing each set separately; finally combining results from each analysis together. This approach is effective in handling large data sets that are unsuitable to be analyzed entirely by a single computer due to limits either from memory storage or computational time. The combined results will provide a statistical inference which is similar to the one from analyzing the entire data set. This article reviews some recently developments of divide-and-conquer methods in a variety of settings, including combining based on parametric, semiparametric and nonparametric models, online sequential updating methods, among others. Theoretical development on the efficiency of the divide-and-conquer methods is also discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2017

A Random Sample Partition Data Model for Big Data Analysis

Big data sets must be carefully partitioned into statistically similar d...
research
12/22/2018

Distributed sequential method for analyzing massive data

To analyse a very large data set containing lengthy variables, we adopt ...
research
06/13/2019

Individualized Group Learning

Many massive data are assembled through collections of information of a ...
research
05/05/2018

Decentralized Nonparametric Multiple Testing

Consider a big data multiple testing task, where, due to storage and com...
research
09/25/2021

Statistical Inference for Data Integration

In the age of big data, data integration is a critical step especially i...
research
05/30/2017

The Role of Data Analysis in the Development of Intelligent Energy Networks

Data analysis plays an important role in the development of intelligent ...
research
08/18/2017

Two provably consistent divide and conquer clustering algorithms for large networks

In this article, we advance divide-and-conquer strategies for solving th...

Please sign up or login with your details

Forgot password? Click here to reset