Binomial Mixture Model With U-shape Constraint

07/29/2021
by   Yuting Ye, et al.
0

In this article, we study the binomial mixture model under the regime that the binomial size m can be relatively large compared to the sample size n. This project is motivated by the GeneFishing method (Liu et al., 2019), whose output is a combination of the parameter of interest and the subsampling noise. To tackle the noise in the output, we utilize the observation that the density of the output has a U shape and model the output with the binomial mixture model under a U shape constraint. We first analyze the estimation of the underlying distribution F in the binomial mixture model under various conditions for F. Equipped with these theoretical understandings, we propose a simple method Ucut to identify the cutoffs of the U shape and recover the underlying distribution based on the Grenander estimator (Grenander, 1956). It has been shown that when m = Ω(n^2/3), the identified cutoffs converge at the rate O(n^-1/3). The L_1 distance between the recovered distribution and the true one decreases at the same rate. To demonstrate the performance, we apply our method to varieties of simulation studies, a GTEX dataset used in (Liu et al., 2019) and a single cell dataset from Tabula Muris.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2020

Mixture-based estimation of entropy

The entropy is a measure of uncertainty that plays a central role in inf...
research
11/08/2020

Consistency of the MLE under a two-parameter gamma mixture model with a structural shape parameter

The finite Gamma mixture model is often used to describe randomness in i...
research
03/27/2019

Maximum Likelihood Estimation of a Semiparametric Two-component Mixture Model using Log-concave Approximation

Motivated by studies in biological sciences to detect differentially exp...
research
03/08/2019

Computer code validation via mixture model estimation

When computer codes are used for modeling complex physical systems, thei...
research
02/25/2013

On learning parametric-output HMMs

We present a novel approach for learning an HMM whose outputs are distri...
research
08/02/2018

Statistical Speech Model Description with VMF Mixture Model

In this paper, we present the LSF parameters by a unit vector form, whic...
research
02/05/2022

Beyond Black Box Densities: Parameter Learning for the Deviated Components

As we collect additional samples from a data population for which a know...

Please sign up or login with your details

Forgot password? Click here to reset