Cooperative Training for Attribute-Distributed Data: Trade-off Between Data Transmission and Performance

07/29/2009
by   Haipeng Zheng, et al.
0

This paper introduces a modeling framework for distributed regression with agents/experts observing attribute-distributed data (heterogeneous data). Under this model, a new algorithm, the iterative covariance optimization algorithm (ICOA), is designed to reshape the covariance matrix of the training residuals of individual agents so that the linear combination of the individual estimators minimizes the ensemble training error. Moreover, a scheme (Minimax Protection) is designed to provide a trade-off between the number of data instances transmitted among the agents and the performance of the ensemble estimator without undermining the convergence of the algorithm. This scheme also provides an upper bound (with high probability) on the test error of the ensemble estimator. The efficacy of ICOA combined with Minimax Protection and the comparison between the upper bound and actual performance are both demonstrated by simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2021

An upper bound on the Universality of the Quantum Approximate Optimization Algorithm

Using lie algebra, this brief text provides an upper bound on the univer...
research
06/15/2023

Non-Asymptotic Performance of Social Machine Learning Under Limited Data

This paper studies the probability of error associated with the social m...
research
04/05/2022

Nearly minimax robust estimator of the mean vector by iterative spectral dimension reduction

We study the problem of robust estimation of the mean vector of a sub-Ga...
research
12/03/2020

Distributed Thompson Sampling

We study a cooperative multi-agent multi-armed bandits with M agents and...
research
10/14/2021

Near optimal sample complexity for matrix and tensor normal models via geodesic convexity

The matrix normal model, the family of Gaussian matrix-variate distribut...
research
02/04/2021

Improved Communication Efficiency for Distributed Mean Estimation with Side Information

In this paper, we consider the distributed mean estimation problem where...
research
11/02/2020

Coresets for Regressions with Panel Data

This paper introduces the problem of coresets for regression problems to...

Please sign up or login with your details

Forgot password? Click here to reset