Batch mode active learning for efficient parameter estimation

04/05/2023
by   Wei Zheng, et al.
0

For many tasks of data analysis, we may only have the information of the explanatory variable and the evaluation of the response values are quite expensive. While it is impractical or too costly to obtain the responses of all units, a natural remedy is to judiciously select a good sample of units, for which the responses are to be evaluated. In this paper, we adopt the classical criteria in design of experiments to quantify the information of a given sample regarding parameter estimation. Then, we provide a theoretical justification for approximating the optimal sample problem by a continuous problem, for which fast algorithms can be further developed with the guarantee of global convergence. Our results have the following novelties: (i) The statistical efficiency of any candidate sample can be evaluated without knowing the exact optimal sample; (ii) It can be applied to a very wide class of statistical models; (iii) It can be integrated with a broad class of information criteria; (iv) It is much faster than existing algorithms. (v) A geometric interpretation is adopted to theoretically justify the relaxation of the original combinatorial problem to continuous optimization problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2018

Parameter Estimation of absolute continuous four parameter Geometric Marshall-Olkin bivariate Pareto Distribution

In this paper we formulate a four parameter absolute continuous Geometri...
research
10/08/2022

Unweighted estimation based on optimal sample under measurement constraints

To tackle massive data, subsampling is a practical approach to select th...
research
04/18/2023

Bayesian D-Optimal Design of Experiments with Quantitative and Qualitative Responses

Systems with both quantitative and qualitative responses are widely enco...
research
08/18/2016

Active Learning for Approximation of Expensive Functions with Normal Distributed Output Uncertainty

When approximating a black-box function, sampling with active learning f...
research
11/03/2015

Consistent Parameter Estimation for LASSO and Approximate Message Passing

We consider the problem of recovering a vector β_o ∈R^p from n random an...
research
06/27/2012

Distributed Parameter Estimation via Pseudo-likelihood

Estimating statistical models within sensor networks requires distribute...

Please sign up or login with your details

Forgot password? Click here to reset