A Novel Bayesian Multiple Testing Approach to Deregulated miRNA Discovery Harnessing Positional Clustering

11/10/2017
by   Noirrit Kiran Chandra, et al.
0

MicroRNAs (miRNAs) are endogenous, small non-coding RNAs that function as regulators of gene expression. In recent years, there has been a tremendous and growing interest among researchers to investigate the role of miRNAs in normal cellular as well as in disease processes. Thus to investigate the role of miRNAs in oral cancer, we analyse the expression levels of miRNAs to identify miRNAs with statistically significant differential expression in cancer tissues. In this article, we propose a novel Bayesian hierarchical model of miRNA expression data. Compelling evidences have demonstrated that the transcription process of miRNAs in human genome is a latent process instrumental for the observed expression levels. We take into account positional clustering of the miRNAs in the analysis and model the latent transcription phenomenon nonparametrically by an appropriate Gaussian process. For the testing purpose we employ a novel Bayesian multiple testing method to identify the differentially expressed miRNAs which are actually statistically significant. Most of the existing multiple testing methods focus on the validity or admissibility of the procedure whereas exploiting the dependence structure between the hypotheses may often yield much closer to truth inference. In our methodology we mainly focus on utilizing the dependence structure between the hypotheses for better results, while also ensuring optimality in many respects. Indeed, our non-marginal method yielded results in accordance with the underlying scientific knowledge which are found to be missed by the very popular Benjamini-Hochberg method.

READ FULL TEXT
research
02/25/2018

Distributions associated with simultaneous multiple hypothesis testing

We develop the distribution of the number of hypotheses found to be stat...
research
03/06/2022

A SVM Model for Candidate Y-chromosome Gene Discovery in Prostate Cancer

Prostate cancer is widely known to be one of the most common cancers amo...
research
04/12/2013

Identifying cancer subtypes in glioblastoma by combining genomic, transcriptomic and epigenomic data

We present a nonparametric Bayesian method for disease subtype discovery...
research
06/12/2021

Fused inverse-normal method for integrated differential expression analysis of RNA-seq data

Use of next-generation sequencing technologies to transcriptomics (RNA-s...
research
10/13/2017

A deep generative model for single-cell RNA sequencing with application to detecting differentially expressed genes

We propose a probabilistic model for interpreting gene expression levels...
research
02/20/2020

A Bayesian Feature Allocation Model for Identification of Cell Subpopulations Using Cytometry Data

A Bayesian feature allocation model (FAM) is presented for identifying c...
research
09/13/2018

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

The availability of large microarray data has led to a growing interest ...

Please sign up or login with your details

Forgot password? Click here to reset