Gene Transformer: Transformers for the Gene Expression-based Classification of Cancer Subtypes

08/26/2021
by   Anwar Khan, et al.
17

Adenocarcinoma and squamous cell carcinoma constitute approximately 40 30 in terms of clinical and molecular responses to therapy. Molecular subtyping has enabled precision medicine to overcome these challenges and provide significant biological insights to predict prognosis and improve clinical decision making. Over the past decade, conventional ML algorithms and DL-based CNNs have been espoused for the classification of cancer subtypes from gene expression datasets. However, these methods are potentially biased toward identification of cancer biomarkers. Recently proposed transformer-based architectures that leverage the self-attention mechanism encode high throughput gene expressions and learn representations that are computationally complex and parametrically expensive. However, compared to the datasets for natural language processing applications, gene expression consists of several hundreds of thousands of genes from a limited number of observations, making it difficult to efficiently train transformers for bioinformatics applications. Hence, we propose an end-to-end deep learning approach, Gene Transformer, which addresses the complexity of high-dimensional gene expression with a multi-head self-attention module by identifying relevant biomarkers across multiple cancer subtypes without requiring feature selection as a prerequisite for the current classification algorithms. The proposed architecture achieved an overall improved performance for all evaluation metrics and had fewer misclassification errors than the commonly used traditional classification algorithms. The classification results show that Gene Transformer can be an efficient approach for classifying cancer subtypes, indicating that any improvement in deep learning models in computational biology can also be reflected well in this domain.

READ FULL TEXT

page 1

page 3

page 5

page 8

research
05/04/2023

Fuzzy Gene Selection and Cancer Classification Based on Deep Learning Model

Machine learning (ML) approaches have been used to develop highly accura...
research
12/20/2018

A Method to Facilitate Cancer Detection and Type Classification from Gene Expression Data using a Deep Autoencoder and Neural Network

With the increased affordability and availability of whole-genome sequen...
research
12/09/2016

DeepCancer: Detecting Cancer through Gene Expressions via Deep Generative Learning

Transcriptional profiling on microarrays to obtain gene expressions has ...
research
05/31/2022

A robust and lightweight deep attention multiple instance learning algorithm for predicting genetic alterations

Deep-learning models based on whole-slide digital pathology images (WSIs...
research
07/24/2023

A Hybrid Machine Learning Model for Classifying Gene Mutations in Cancer using LSTM, BiLSTM, CNN, GRU, and GloVe

This study presents an ensemble model combining LSTM, BiLSTM, CNN, GRU, ...
research
08/01/2017

Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin

The past decade has seen a revolution in genomic technologies that enabl...
research
03/19/2023

Studying Limits of Explainability by Integrated Gradients for Gene Expression Models

Understanding the molecular processes that drive cellular life is a fund...

Please sign up or login with your details

Forgot password? Click here to reset