Correlating Cellular Features with Gene Expression using CCA

02/24/2018
by   Vaishnavi Subramanian, et al.
0

To understand the biology of cancer, joint analysis of multiple data modalities, including imaging and genomics, is crucial. The involved nature of gene-microenvironment interactions necessitates the use of algorithms which treat both data types equally. We propose the use of canonical correlation analysis (CCA) and a sparse variant as a preliminary discovery tool for identifying connections across modalities, specifically between gene expression and features describing cell and nucleus shape, texture, and stain intensity in histopathological images. Applied to 615 breast cancer samples from The Cancer Genome Atlas, CCA revealed significant correlation of several image features with expression of PAM50 genes, known to be linked to outcome, while Sparse CCA revealed associations with enrichment of pathways implicated in cancer without leveraging prior biological understanding. These findings affirm the utility of CCA for joint phenotype-genotype analysis of cancer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2020

Biological Random Walks: integrating heterogeneous data in disease gene prioritization

This work proposes a unified framework to leverage biological informatio...
research
08/18/2017

Data-Driven Tree Transforms and Metrics

We consider the analysis of high dimensional data given in the form of a...
research
01/31/2011

Dependency detection with similarity constraints

Unsupervised two-view learning, or detection of dependencies between two...
research
03/17/2023

Breast Cancer Histopathology Image based Gene Expression Prediction using Spatial Transcriptomics data and Deep Learning

Tumour heterogeneity in breast cancer poses challenges in predicting out...
research
03/20/2014

Network-based Isoform Quantification with RNA-Seq Data for Cancer Transcriptome Analysis

High-throughput mRNA sequencing (RNA-Seq) is widely used for transcript ...
research
03/19/2023

Studying Limits of Explainability by Integrated Gradients for Gene Expression Models

Understanding the molecular processes that drive cellular life is a fund...
research
11/05/2021

Compressed spectral screening for large-scale differential correlation analysis with application in selecting Glioblastoma gene modules

Differential co-expression analysis has been widely applied by scientist...

Please sign up or login with your details

Forgot password? Click here to reset