Minimax Optimality of Sign Test for Paired Heterogeneous Data

01/11/2018
by   Martin J. Zhang, et al.
0

Comparing two groups under different conditions is ubiquitous in the biomedical sciences. In many cases, samples from the two groups can be naturally paired; for example a pair of samples may come from the same individual under the two conditions. However samples across different individuals may be highly heterogeneous. Traditional methods often ignore such heterogeneity by assuming the samples are identically distributed. In this work, we study the problem of comparing paired heterogeneous data by modeling the data as Gaussian distributed with different parameters across the samples. We show that in the minimax setting where we want to maximize the worst-case power, the sign test, which only uses the signs of the differences between the paired sample, is optimal in the one-sided case and near optimal in the two-sided case. The superiority of the sign test over other popular tests for paired heterogeneous data is demonstrated using both synthetic data and a real-world RNA-Seq dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2021

Weighted Mean Difference Statistics for Paired Data in Presence of Missing Values

Missing data is a common issue in many biomedical studies. Under a paire...
research
05/26/2022

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification

While a broad range of techniques have been proposed to tackle distribut...
research
10/10/2022

PoGaIN: Poisson-Gaussian Image Noise Modeling from Paired Samples

Image noise can often be accurately fitted to a Poisson-Gaussian distrib...
research
06/20/2022

Beyond IID: data-driven decision-making in heterogeneous environments

In this work, we study data-driven decision-making and depart from the c...
research
02/21/2023

Density Ratio Estimation and Neyman Pearson Classification with Missing Data

Density Ratio Estimation (DRE) is an important machine learning techniqu...
research
11/22/2022

Optimal design of the Wilcoxon-Mann-Whitney-test

In scientific research, many hypotheses relate to the comparison of two ...
research
01/10/2023

Matching calipers and the precision of index estimation

This paper characterizes the precision of index estimation as it carries...

Please sign up or login with your details

Forgot password? Click here to reset