MIXALIME: MIXture models for ALlelic IMbalance Estimation in high-throughput sequencing data

06/14/2023
by   Georgy Meshcheryakov, et al.
0

Modern high-throughput sequencing assays efficiently capture not only gene expression and different levels of gene regulation but also a multitude of genome variants. Focused analysis of alternative alleles of variable sites at homologous chromosomes of the human genome reveals allele-specific gene expression and allele-specific gene regulation by assessing allelic imbalance of read counts at individual sites. Here we formally describe an advanced statistical framework for detecting the allelic imbalance in allelic read counts at single-nucleotide variants detected in diverse omics studies (ChIP-Seq, ATAC-Seq, DNase-Seq, CAGE-Seq, and others). MIXALIME accounts for copy-number variants and aneuploidy, reference read mapping bias, and provides several scoring models to balance between sensitivity and specificity when scoring data with varying levels of experimental noise-caused overdispersion.

READ FULL TEXT

page 6

page 8

page 15

page 21

page 23

page 26

research
02/07/2020

A mathematical framework for raw counts of single-cell RNA-seq data analysis

Single-cell RNA-seq data are challenging because of the sparseness of th...
research
09/20/2013

mTim: Rapid and accurate transcript reconstruction from RNA-Seq data

Recent advances in high-throughput cDNA sequencing (RNA-Seq) technology ...
research
12/12/2020

Increased peak detection accuracy in over-dispersed ChIP-seq data with supervised segmentation models

Motivation: Histone modification constitutes a basic mechanism for the g...
research
06/25/2021

Multi-scale Poisson process approaches for differential expression analysis of high-throughput sequencing data

Estimating and testing for differences in molecular phenotypes (e.g. gen...
research
02/22/2018

SMAGEXP: a galaxy tool suite for transcriptomics data meta-analysis

Bakground: With the proliferation of available microarray and high throu...
research
07/24/2023

Clustering MIC data through Bayesian mixture models: an application to detect M. Tuberculosis resistance mutations

Antimicrobial resistance is becoming a major threat to public health thr...
research
04/08/2018

eQTL Mapping via Effective SNP Ranking and Screening

Genome-wide eQTL mapping explores the relationship between gene expressi...

Please sign up or login with your details

Forgot password? Click here to reset