Simultaneous estimation of normal means with side information

08/16/2019
by   Sihai Dave Zhao, et al.
0

The integrative analysis of multiple datasets is an important strategy in data analysis. It is increasingly popular in genomics, which enjoys a wealth of publicly available datasets that can be compared, contrasted, and combined in order to extract novel scientific insights. This paper studies a stylized example of data integration for a classical statistical problem: leveraging side information to estimate a vector of normal means. This task is formulated as a compound decision problem, an oracle integrative decision rule is derived, and a data-driven estimate of this rule based on minimizing an unbiased estimate of its risk is proposed. The data-driven rule is shown to asymptotically achieve the minimum possible risk among all separable decision rules, and it can outperform existing methods in numerical properties. The proposed procedure leads naturally to an integrative high-dimensional classification procedure, which is illustrated by combining data from two independent gene expression profiling studies.

READ FULL TEXT

page 12

page 15

page 22

research
12/28/2022

The Right to be an Exception to a Data-Driven Rule

Data-driven tools are increasingly used to make consequential decisions....
research
04/30/2022

A nonparametric regression approach to asymptotically optimal estimation of normal means

Simultaneous estimation of multiple parameters has received a great deal...
research
11/26/2019

On Optimal Solutions to Compound Statistical Decision Problems

In a compound decision problem, consisting of n statistically independen...
research
05/10/2020

A nonparametric empirical Bayes approach to covariance matrix estimation

We propose an empirical Bayes method to estimate high-dimensional covari...
research
11/17/2021

Unbiased Risk Estimation in the Normal Means Problem via Coupled Bootstrap Techniques

We study a new method for estimating the risk of an arbitrary estimator ...
research
12/01/2022

Transfer Learning for High-dimensional Quantile Regression via Convolution Smoothing

This paper studies the high-dimensional quantile regression problem unde...
research
09/18/2017

Normal Integration: A Survey

The need for efficient normal integration methods is driven by several c...

Please sign up or login with your details

Forgot password? Click here to reset