Graph Neural Networks for Breast Cancer Data Integration

11/28/2022
by   Teodora Reu, et al.
0

International initiatives such as METABRIC (Molecular Taxonomy of Breast Cancer International Consortium) have collected several multigenomic and clinical data sets to identify the undergoing molecular processes taking place throughout the evolution of various cancers. Numerous Machine Learning and statistical models have been designed and trained to analyze these types of data independently, however, the integration of such differently shaped and sourced information streams has not been extensively studied. To better integrate these data sets and generate meaningful representations that can ultimately be leveraged for cancer detection tasks could lead to giving well-suited treatments to patients. Hence, we propose a novel learning pipeline comprising three steps - the integration of cancer data modalities as graphs, followed by the application of Graph Neural Networks in an unsupervised setting to generate lower-dimensional embeddings from the combined data, and finally feeding the new representations on a cancer sub-type classification model for evaluation. The graph construction algorithms are described in-depth as METABRIC does not store relationships between the patient modalities, with a discussion of their influence over the quality of the generated embeddings. We also present the models used to generate the lower-latent space representations: Graph Neural Networks, Variational Graph Autoencoders and Deep Graph Infomax. In parallel, the pipeline is tested on a synthetic dataset to demonstrate that the characteristics of the underlying data, such as homophily levels, greatly influence the performance of the pipeline, which ranges between 51% to 98% accuracy on artificial data, and 13% and 80% on METABRIC. This project has the potential to improve cancer data understanding and encourages the transition of regular data sets to graph-shaped data.

READ FULL TEXT
research
11/26/2018

A Framework for Implementing Machine Learning on Omics Data

The potential benefits of applying machine learning methods to -omics da...
research
02/24/2017

Microwave breast cancer detection using Empirical Mode Decomposition features

Microwave-based breast cancer detection has been proposed as a complemen...
research
07/04/2018

Robust Identification of Target Genes and Outliers in Triple-negative Breast Cancer Data

Correct classification of breast cancer sub-types is of high importance ...
research
02/24/2023

A Multimodal Graph Neural Network Framework for Cancer Molecular Subtype Classification

The recent development of high-throughput sequencing creates a large col...
research
10/02/2020

Efficient Colon Cancer Grading with Graph Neural Networks

Dealing with the application of grading colorectal cancer images, this w...
research
10/26/2019

Understanding Isomorphism Bias in Graph Data Sets

In recent years there has been a rapid increase in classification method...
research
12/06/2019

Deep Bayesian Recurrent Neural Networks for Somatic Variant Calling in Cancer

The emerging field of precision oncology relies on the accurate pinpoint...

Please sign up or login with your details

Forgot password? Click here to reset