Binned multinomial logistic regression for integrative cell type annotation

11/23/2021
by   Keshav Motwani, et al.
0

Categorizing individual cells into one of many known cell type categories, also known as cell type annotation, is a critical step in the analysis of single-cell genomics data. The current process of annotation is time-intensive and subjective, which has led to different studies describing cell types with labels of varying degrees of resolution. While supervised learning approaches have provided automated solutions to annotation, there remains a significant challenge in fitting a unified model for multiple datasets with inconsistent labels. In this article, we propose a new multinomial logistic regression estimator which can be used to model cell type probabilities by integrating multiple datasets with labels of varying resolution. To compute our estimator, we solve a nonconvex optimization problem using a blockwise proximal gradient descent algorithm. We show through simulation studies that our approach estimates cell type probabilities more accurately than competitors in a wide variety of scenarios. We apply our method to ten single-cell RNA-seq datasets and demonstrate its utility in predicting fine resolution cell type labels on unlabeled data as well as refining cell type labels on data with existing coarse resolution annotations. An R package implementing the method is available at https://github.com/keshav-motwani/IBMR and the collection of datasets we analyze is available at https://github.com/keshav-motwani/AnnotatedPBMC.

READ FULL TEXT
research
08/29/2022

Multiresolution categorical regression for interpretable cell type annotation

In many categorical response regression applications, the response categ...
research
04/05/2023

Revolutionizing Single Cell Analysis: The Power of Large Language Models for Cell Type Annotation

In recent years, single cell RNA sequencing has become a widely used tec...
research
08/11/2022

Interpretable cytometry cell-type annotation with flow-based deep generative models

Cytometry enables precise single-cell phenotyping within heterogeneous p...
research
05/31/2019

Improving the resolution of CryoEM single particle analysis

We present a new 3D refinement method for CryoEM single particle analysi...
research
06/14/2023

Semi-supervised Cell Recognition under Point Supervision

Cell recognition is a fundamental task in digital histopathology image a...
research
05/25/2022

LingMess: Linguistically Informed Multi Expert Scorers for Coreference Resolution

While coreference resolution typically involves various linguistic chall...
research
07/18/2019

optimalFlow: Optimal-transport approach to flow cytometry gating and population matching

Data used in Flow Cytometry present pronounced variability due to biolog...

Please sign up or login with your details

Forgot password? Click here to reset