Active feature selection discovers minimal gene-sets for classifying cell-types and disease states in single-cell mRNA-seq data

06/15/2021
by   Xiaoqiao Chen, et al.
0

Sequencing costs currently prohibit the application of single cell mRNA-seq for many biological and clinical tasks of interest. Here, we introduce an active learning framework that constructs compressed gene sets that enable high accuracy classification of cell-types and physiological states while analyzing a minimal number of gene transcripts. Our active feature selection procedure constructs gene sets through an iterative cell-type classification task where misclassified cells are examined at each round to identify maximally informative genes through an `active' support vector machine (SVM) classifier. Our active SVM procedure automatically identifies gene sets that enables >90% cell-type classification accuracy in the Tabula Muris mouse tissue survey as well as a ∼ 40 gene set that enables classification of multiple myeloma patient samples with >95% accuracy. Broadly, the discovery of compact but highly informative gene sets might enable drastic reductions in sequencing requirements for applications of single-cell mRNA-seq.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2022

MarkerMap: nonlinear marker selection for single-cell studies

Single-cell RNA-seq data allow the quantification of cell type differenc...
research
08/04/2020

Detecting ulcerative colitis from colon samples using efficient feature selection and machine learning

Ulcerative colitis (UC) is one of the most common forms of inflammatory ...
research
06/13/2018

Cell Identity Codes: Understanding Cell Identity from Gene Expression Profiles using Deep Neural Networks

Understanding cell identity is an important task in many biomedical area...
research
08/31/2017

Applications of Biological Cell Models in Robotics

In this paper I present some of the most representative biological model...
research
06/23/2023

Analyzing scRNA-seq data by CCP-assisted UMAP and t-SNE

Single-cell RNA sequencing (scRNA-seq) is widely used to reveal heteroge...
research
10/01/2021

A systematic evaluation of methods for cell phenotype classification using single-cell RNA sequencing data

Background: Single-cell RNA sequencing (scRNA-seq) yields valuable insig...
research
03/13/2018

Bayesian Detection of Abnormal ADS in Mutant Caenorhabditis elegans Embryos

Cell division timing is critical for cell fate specification and morphog...

Please sign up or login with your details

Forgot password? Click here to reset