Factorized linear discriminant analysis for phenotype-guided representation learning of neuronal gene expression data

10/05/2020
by   Mu Qiao, et al.
0

A central goal in neurobiology is to relate the expression of genes to the structural and functional properties of neuronal types, collectively called their phenotypes. Single-cell RNA sequencing can measure the expression of thousands of genes in thousands of neurons. How to interpret the data in the context of neuronal phenotypes? We propose a supervised learning approach that factorizes the gene expression data into components corresponding to individual phenotypic characteristics and their interactions. This new method, which we call factorized linear discriminant analysis (FLDA), seeks a linear transformation of gene expressions that varies highly with only one phenotypic factor and minimally with the others. We further leverage our approach with a sparsity-based regularization algorithm, which selects a few genes important to a specific phenotypic feature or feature combination. We applied this approach to a single-cell RNA-Seq dataset of Drosophila T4/T5 neurons, focusing on their dendritic and axonal phenotypes. The analysis confirms results from the previous report but also points to new genes related to the phenotypes and an intriguing hierarchy in the genetic organization of these cells.

READ FULL TEXT
research
02/04/2016

Discovering Neuronal Cell Types and Their Gene Expression Profiles Using a Spatial Point Process Mixture Model

Cataloging the neuronal cell types that comprise circuitry of individual...
research
02/21/2019

A Nonparametric Multi-view Model for Estimating Cell Type-Specific Gene Regulatory Networks

We present a Bayesian hierarchical multi-view mixture model termed Symph...
research
10/25/2022

A single-cell gene expression language model

Gene regulation is a dynamic process that connects genotype and phenotyp...
research
06/18/2020

Sparse Bottleneck Networks for Exploratory Analysis and Visualization of Neural Patch-seq Data

In recent years, increasingly large datasets with two different sets of ...
research
07/10/2018

DeepDiff: Deep-learning for predicting Differential gene expression from histone modifications

Computational methods that predict differential gene expression from his...
research
08/31/2019

Triclustering of Gene Expression Microarray Data Using Coarse-Grained Parallel Genetic Algorithm

Microarray data analysis is one of the major area of research in the fie...
research
08/29/2023

From RNA sequencing measurements to the final results: a practical guide to navigating the choices and uncertainties of gene set analysis

Gene set analysis, a popular approach for analyzing high-throughput gene...

Please sign up or login with your details

Forgot password? Click here to reset