KekuleScope: improved prediction of cancer cell line sensitivity using convolutional neural networks trained on compound images

11/22/2018
by   Isidro Cortes-Ciriano, et al.
0

The application of convolutional neural networks (ConvNets) to harness high-content screening images or 2D compound representations is gaining increasing attention in drug discovery. However, existing applications often require large data sets for training, or sophisticated pretraining schemes for the networks. Here, we show on eight cytotoxicity IC50 data sets from ChEMBL 23 that the in vitro activity of compounds on cancer cell lines can be accurately predicted on a continuous scale from their Kekulé structure representations alone by extending existing architectures (AlexNet, DenseNet-201, ResNet152 and VGG-19), which were pretrained on unrelated image data sets. We show that the predictive power of the generated models, which just require standard 2D compound representations as input, is comparable to that of Random Forest (RF) models trained on circular (Morgan) fingerprints, a combination which is considered to be the state of the art. Notably, including additional fully-connected layers further increases the predictive power of the networks by up to 10 shows that by simply averaging the output of the RF models and ConvNets we constantly obtain significantly lower errors in prediction (4-12 RMSE on the test set) than those obtained with either model alone, indicating that the features extracted by the convolutional layers of the ConvNets provide complementary predictive signal to Morgan fingerprints. Overall, in this work we present a set of ConvNet architectures for the prediction of compound activity from their Kekulé structure representations with state-of-the-art performance, that require no generation of compound descriptors or use of sophisticated image processing techniques. The data sets and the code used are provided at https://github.com/isidroc/kekulescope.

READ FULL TEXT

page 26

page 27

page 29

page 30

research
04/12/2019

Reliable Prediction Errors for Deep Neural Networks Using Test-Time Dropout

While the use of deep learning in drug discovery is gaining increasing a...
research
12/28/2018

Drug cell line interaction prediction

Understanding the phenotypic drug response on cancer cell lines plays a ...
research
09/24/2018

Deep Confidence: A Computationally Efficient Framework for Calculating Reliable Errors for Deep Neural Networks

Deep learning architectures have proved versatile in a number of drug di...
research
06/21/2018

Interpretable Discovery in Large Image Data Sets

Automated detection of new, interesting, unusual, or anomalous images wi...
research
05/17/2021

Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud Detection

With the development of high technology, the scope of fraud is increasin...
research
10/02/2018

A Deep Autoencoder System for Differentiation of Cancer Types Based on DNA Methylation State

A Deep Autoencoder based content retrieval algorithm is proposed for pre...
research
01/31/2021

CODE-AE: A Coherent De-confounding Autoencoder for Predicting Patient-Specific Drug Response From Cell Line Transcriptomics

Accurate and robust prediction of patient's response to drug treatments ...

Please sign up or login with your details

Forgot password? Click here to reset