MixTwice: large-scale hypothesis testing for peptide arrays by variance mixing

11/15/2020 ∙ by Zihao Zheng, et al. ∙ 0

Peptide microarrays have emerged as a powerful technology in immunoproteomics as they provide a tool to measure the abundance of different antibodies in patient serum samples. The high dimensionality and small sample size of many experiments challenge conventional statistical approaches, including those aiming to control the false discovery rate (FDR). Motivated by limitations in reproducibility and power of current methods, we advance an empirical Bayesian tool that computes local false discovery rate statistics and local false sign rate statistics when provided with data on estimated effects and estimated standard errors from all the measured peptides. As the name suggests, the \verb+MixTwice+ tool involves the estimation of two mixing distributions, one on underlying effects and one on underlying variance parameters. Constrained optimization techniques provide for model fitting of mixing distributions under weak shape constraints (unimodality of the effect distribution). Numerical experiments show that \verb+MixTwice+ can accurately estimate generative parameters and powerfully identify non-null peptides. In a peptide array study of rheumatoid arthritis (RA), \verb+MixTwice+ recovers meaningful peptide markers in one case where the signal is weak, and has strong reproducibility properties in one case where the signal is strong. \verb+MixTwice+ is available as an R software package. \href{https://github.com/wiscstatman/MixTwice}{https://github.com/wiscstatman/MixTwice}



There are no comments yet.


page 12

page 14

page 17

page 20

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.