Deep Learning-Based Strategy for Macromolecules Classification with Imbalanced Data from Cellular Electron Cryotomography

08/27/2019
by   Ziqian Luo, et al.
18

Deep learning model trained by imbalanced data may not work satisfactorily since it could be determined by major classes and thus may ignore the classes with small amount of data. In this paper, we apply deep learning based imbalanced data classification for the first time to cellular macromolecular complexes captured by Cryo-electron tomography (Cryo-ET). We adopt a range of strategies to cope with imbalanced data, including data sampling, bagging, boosting, Genetic Programming based method and. Particularly, inspired from Inception 3D network, we propose a multi-path CNN model combining focal loss and mixup on the Cryo-ET dataset to expand the dataset, where each path had its best performance corresponding to each type of data and let the network learn the combinations of the paths to improve the classification performance. In addition, extensive experiments have been conducted to show our proposed method is flexible enough to cope with different number of classes by adjusting the number of paths in our multi-path model. To our knowledge, this work is the first application of deep learning methods of dealing with imbalanced data to the internal tissue classification of cell macromolecular complexes, which opened up a new path for cell classification in the field of computational biology.

READ FULL TEXT

page 1

page 5

research
06/02/2021

Hybrid Ensemble optimized algorithm based on Genetic Programming for imbalanced data classification

One of the most significant current discussions in the field of data min...
research
04/28/2018

Imbalanced Deep Learning by Minority Class Incremental Rectification

Model learning from class imbalanced training data is a long-standing an...
research
08/16/2021

TL-SDD: A Transfer Learning-Based Method for Surface Defect Detection with Few Samples

Surface defect detection plays an increasingly important role in manufac...
research
06/25/2023

DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets

Nuclei segmentation and classification is a significant process in patho...
research
07/17/2018

Pseudo-Feature Generation for Imbalanced Data Analysis in Deep Learning

We generate pseudo-features by multivariate probability distributions ob...
research
07/12/2023

Early Autism Diagnosis based on Path Signature and Siamese Unsupervised Feature Compressor

Autism Spectrum Disorder (ASD) has been emerging as a growing public hea...
research
09/05/2023

BeeTLe: A Framework for Linear B-Cell Epitope Prediction and Classification

The process of identifying and characterizing B-cell epitopes, which are...

Please sign up or login with your details

Forgot password? Click here to reset