Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets

02/26/2019
by   Homayun Afrabandpey, et al.
8

Learning predictive models from small high-dimensional data sets is a key problem in high-dimensional statistics. Expert knowledge elicitation can help, and a strong line of work focuses on directly eliciting informative prior distributions for parameters. This either requires considerable statistical expertise or is laborious, as the emphasis has been on accuracy and not on efficiency of the process. Another line of work queries about importance of features one at a time, assuming them to be independent and hence missing covariance information. In contrast, we propose eliciting expert knowledge about pairwise feature similarities, to borrow statistical strength in the predictions, and using sequential decision making techniques to minimize the effort of the expert. Empirical results demonstrate improvement in predictive performance on both simulated and real data, in high-dimensional linear regression tasks, where we learn the covariance structure with a Gaussian process, based on sequential elicitation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2016

Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Prediction in a small-sized sample with a large number of covariates, th...
research
01/22/2019

A Fast Iterative Algorithm for High-dimensional Differential Network

Differential network is an important tool to capture the changes of cond...
research
08/26/2022

High-dimensional sparse vine copula regression with application to genomic prediction

High-dimensional data sets are often available in genome-enabled predict...
research
11/08/2017

Learning Credible Models

In many settings, it is important that a model be capable of providing r...
research
09/15/2019

Approximating posteriors with high-dimensional nuisance parameters via integrated rotated Gaussian approximation

Posterior computation for high-dimensional data with many parameters can...
research
12/09/2019

Expert-guided Regularization via Distance Metric Learning

High-dimensional prediction is a challenging problem setting for traditi...
research
11/03/2020

Graph Enhanced High Dimensional Kernel Regression

In this paper, the flexibility, versatility and predictive power of kern...

Please sign up or login with your details

Forgot password? Click here to reset