Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

01/18/2022
by   Siyuan Shan, et al.
0

Modern single-cell flow and mass cytometry technologies measure the expression of several proteins of the individual cells within a blood or tissue sample. Each profiled biological sample is thus represented by a set of hundreds of thousands of multidimensional cell feature vectors, which incurs a high computational cost to predict each biological sample's associated phenotype with machine learning models. Such a large set cardinality also limits the interpretability of machine learning models due to the difficulty in tracking how each individual cell influences the ultimate prediction. Using Kernel Mean Embedding to encode the cellular landscape of each profiled biological sample, we can train a simple linear classifier and achieve state-of-the-art classification accuracy on 3 flow and mass cytometry datasets. Our model contains few parameters but still performs similarly to deep learning models with millions of parameters. In contrast with deep learning approaches, the linearity and sub-selection step of our model make it easy to interpret classification results. Clustering analysis further shows that our method admits rich biological interpretability for linking cellular heterogeneity to clinical phenotype.

READ FULL TEXT

page 11

page 13

research
06/30/2022

Distribution-based Sketching of Single-Cell Samples

Modern high-throughput single-cell immune profiling technologies, such a...
research
02/14/2020

Disease State Prediction From Single-Cell Data Using Graph Attention Networks

Single-cell RNA sequencing (scRNA-seq) has revolutionized biological dis...
research
09/15/2023

MIML: Multiplex Image Machine Learning for High Precision Cell Classification via Mechanical Traits within Microfluidic Systems

Label-free cell classification is advantageous for supplying pristine ce...
research
12/18/2019

Cluster Analysis of High-Dimensional scRNA Sequencing Data

With ongoing developments and innovations in single-cell RNA sequencing ...
research
08/07/2018

Capturing global spatial context for accurate cell classification in skin cancer histology

The spectacular response observed in clinical trials of immunotherapy in...
research
03/02/2022

Machine learning based lens-free imaging technique for field-portable cytometry

Lens-free Shadow Imaging Technique (LSIT) is a well-established techniqu...

Please sign up or login with your details

Forgot password? Click here to reset