Nonparametric Empirical Bayes Estimation and Testing for Sparse and Heteroscedastic Signals

06/16/2021
by   Junhui Cai, et al.
0

Large-scale modern data often involves estimation and testing for high-dimensional unknown parameters. It is desirable to identify the sparse signals, “the needles in the haystack”, with accuracy and false discovery control. However, the unprecedented complexity and heterogeneity in modern data structure require new machine learning tools to effectively exploit commonalities and to robustly adjust for both sparsity and heterogeneity. In addition, estimates for high-dimensional parameters often lack uncertainty quantification. In this paper, we propose a novel Spike-and-Nonparametric mixture prior (SNP) – a spike to promote the sparsity and a nonparametric structure to capture signals. In contrast to the state-of-the-art methods, the proposed methods solve the estimation and testing problem at once with several merits: 1) an accurate sparsity estimation; 2) point estimates with shrinkage/soft-thresholding property; 3) credible intervals for uncertainty quantification; 4) an optimal multiple testing procedure that controls false discovery rate. Our method exhibits promising empirical performance on both simulated data and a gene expression case study.

READ FULL TEXT

page 15

page 18

page 20

page 22

research
07/12/2023

Empirical Bayes large-scale multiple testing for high-dimensional sparse binary sequences

This paper investigates the multiple testing problem for high-dimensiona...
research
01/30/2023

Convergence of uncertainty estimates in Ensemble and Bayesian sparse model discovery

Sparse model identification enables nonlinear dynamical system discovery...
research
08/29/2018

On spike and slab empirical Bayes multiple testing

This paper explores a connection between empirical Bayes posterior distr...
research
10/09/2022

A Locally Adaptive Shrinkage Approach to False Selection Rate Control in High-Dimensional Classification

The uncertainty quantification and error control of classifiers are cruc...
research
05/18/2020

B-CONCORD – A scalable Bayesian high-dimensional precision matrix estimation procedure

Sparse estimation of the precision matrix under high-dimensional scaling...
research
02/01/2021

Empirical Bayes cumulative ℓ-value multiple testing procedure for sparse sequences

In the sparse sequence model, we consider a popular Bayesian multiple te...
research
11/18/2022

Robust oracle estimation and uncertainty quantification for possibly sparse quantiles

A general many quantiles + noise model is studied in the robust formulat...

Please sign up or login with your details

Forgot password? Click here to reset