Transform-Domain Classification of Human Cells based on DNA Methylation Datasets

12/31/2019
by   Xueyuan Zhao, et al.
0

A novel method to classify human cells is presented in this work based on the transform-domain method on DNA methylation data. DNA methylation profile variations are observed in human cells with the progression of disease stages, and the proposal is based on this DNA methylation variation to classify normal and disease cells including cancer cells. The cancer cell types investigated in this work cover hepatocellular (sample size n = 40), colorectal (n = 44), lung (n = 70) and endometrial (n = 87) cancer cells. A new pipeline is proposed integrating the DNA methylation intensity measurements on all the CpG islands by the transformation of Walsh-Hadamard Transform (WHT). The study reveals the three-step properties of the DNA methylation transform-domain data and the step values of association with the cell status. Further assessments have been carried out on the proposed machine learning pipeline to perform classification of the normal and cancer tissue cells. A number of machine learning classifiers are compared for whole sequence and WHT sequence classification based on public Whole-Genome Bisulfite Sequencing (WGBS) DNA methylation datasets. The WHT-based method can speed up the computation time by more than one order of magnitude compared with whole original sequence classification, while maintaining comparable classification accuracy by the selected machine learning classifiers. The proposed method has broad applications in expedited disease and normal human cell classifications by the epigenome and genome datasets.

READ FULL TEXT

page 6

page 7

research
07/24/2018

Convolutional Neural Networks In Classifying Cancer Through DNA Methylation

DNA Methylation has been the most extensively studied epigenetic mark. U...
research
04/27/2023

Data navigation on the ENCODE portal

Spanning two decades, the Encyclopaedia of DNA Elements (ENCODE) is a co...
research
07/09/2021

Hoechst Is All You Need: Lymphocyte Classification with Deep Learning

Multiplex immunofluorescence and immunohistochemistry benefit patients b...
research
01/18/2011

Automated Image Processing for the Analysis of DNA Repair Dynamics

The efficient repair of cellular DNA is essential for the maintenance an...
research
03/21/2022

Statistical classification for Raman spectra of tumoral genomic DNA

We exploit Surface-Enhanced Raman Scattering (SERS) to investigate aqueo...
research
11/26/2018

Interlacing Personal and Reference Genomes for Machine Learning Disease-Variant Detection

DNA sequencing to identify genetic variants is becoming increasingly val...

Please sign up or login with your details

Forgot password? Click here to reset