Subcellular Protein Localisation in the Human Protein Atlas using Ensembles of Diverse Deep Architectures

05/19/2022
by   Syed Sameed Husain, et al.
0

Automated visual localisation of subcellular proteins can accelerate our understanding of cell function in health and disease. Despite recent advances in machine learning (ML), humans still attain superior accuracy by using diverse clues. We show how this gap can be narrowed by addressing three key aspects: (i) automated improvement of cell annotation quality, (ii) new Convolutional Neural Network (CNN) architectures supporting unbalanced and noisy data, and (iii) informed selection and fusion of multiple diverse machine learning models. We introduce a new "AI-trains-AI" method for improving the quality of weak labels and propose novel CNN architectures exploiting wavelet filters and Weibull activations. We also explore key factors in the multi-CNN ensembling process by analysing correlations between image-level and cell-level predictions. Finally, in the context of the Human Protein Atlas, we demonstrate that our system achieves state-of-the-art performance in the multi-label single-cell classification of protein localisation patterns. It also significantly improves generalisation ability.

READ FULL TEXT

page 3

page 5

page 7

page 9

page 10

page 12

research
11/05/2022

Learning the shape of protein micro-environments with a holographic convolutional neural network

Proteins play a central role in biology from immune recognition to brain...
research
03/30/2017

Near Perfect Protein Multi-Label Classification with Deep Neural Networks

Artificial neural networks (ANNs) have gained a well-deserved popularity...
research
06/04/2021

Deep Contextual Learners for Protein Networks

Spatial context is central to understanding health and disease. Yet refe...
research
01/18/2023

Beating the Best: Improving on AlphaFold2 at Protein Structure Prediction

The goal of Protein Structure Prediction (PSP) problem is to predict a p...
research
05/24/2022

Learning multi-scale functional representations of proteins from single-cell microscopy data

Protein function is inherently linked to its localization within the cel...
research
09/30/2022

ModelAngelo: Automated Model Building in Cryo-EM Maps

Electron cryo-microscopy (cryo-EM) produces three-dimensional (3D) maps ...

Please sign up or login with your details

Forgot password? Click here to reset