A Bayesian Feature Allocation Model for Identification of Cell Subpopulations Using Cytometry Data

02/20/2020
by   Arthur Lui, et al.
0

A Bayesian feature allocation model (FAM) is presented for identifying cell subpopulations based on multiple samples of cell surface or intracellular marker expression level data obtained by cytometry by time of flight (CyTOF). Cell subpopulations are characterized by differences in expression patterns of makers, and individual cells are clustered into the subpopulations based on the patterns of their observed expression levels. A finite Indian buffet process is used to model subpopulations as latent features, and a model-based method based on these latent feature subpopulations is used to construct cell clusters within each sample. Non-ignorable missing data due to technical artifacts in mass cytometry instruments are accounted for by defining a static missing data mechanism. In contrast to conventional cell clustering methods based on observed marker expression levels that are applied separately to different samples, the FAM based method can be applied simultaneously to multiple samples, and can identify important cell subpopulations likely to be missed by conventional clustering. The proposed FAM based method is applied to jointly analyze three datasets, generated by CyTOF, to study natural killer (NK) cells. Because the subpopulations identified by the FAM may define novel NK cell subsets, this statistical analysis may provide useful information about the biology of NK cells and their potential role in cancer immunotherapy which may lead, in turn, to development of improved cellular therapies. Simulation studies of the proposed method's behavior under two cases of known subpopulations also are presented, followed by analysis of the CyTOF NK cell surface marker data.

READ FULL TEXT

page 15

page 18

page 19

page 20

page 24

page 25

research
12/05/2022

Shared Differential Clustering across Single-cell RNA Sequencing Datasets with the Hierarchical Dirichlet Process

Single-cell RNA sequencing (scRNA-seq) is powerful technology that allow...
research
08/17/2022

Deep Learning Enabled Time-Lapse 3D Cell Analysis

This paper presents a method for time-lapse 3D cell analysis. Specifical...
research
01/17/2020

Coarsened mixtures of hierarchical skew normal kernels for flow cytometry analyses

Flow cytometry (FCM) is the standard multi-parameter assay used to measu...
research
08/14/2023

Bayesian Inference of Phenotypic Plasticity of Cancer Cells Based on Dynamic Model for Temporal Cell Proportion Data

Mounting evidence underscores the prevalent hierarchical organization of...
research
03/16/2023

Machine Learning for Flow Cytometry Data Analysis

Flow cytometry mainly used for detecting the characteristics of a number...
research
11/10/2017

A Novel Bayesian Multiple Testing Approach to Deregulated miRNA Discovery Harnessing Positional Clustering

MicroRNAs (miRNAs) are endogenous, small non-coding RNAs that function a...
research
07/05/2021

Automated inference of production rules for glycans

Glycans are tree-like polymers made up of sugar monomer building blocks....

Please sign up or login with your details

Forgot password? Click here to reset