Learning Curves for Drug Response Prediction in Cancer Cell Lines

11/25/2020
by   Alexander Partin, et al.
0

Motivated by the size of cell line drug sensitivity data, researchers have been developing machine learning (ML) models for predicting drug response to advance cancer treatment. As drug sensitivity studies continue generating data, a common question is whether the proposed predictors can further improve the generalization performance with more training data. We utilize empirical learning curves for evaluating and comparing the data scaling properties of two neural networks (NNs) and two gradient boosting decision tree (GBDT) models trained on four drug screening datasets. The learning curves are accurately fitted to a power law model, providing a framework for assessing the data scaling behavior of these predictors. The curves demonstrate that no single model dominates in terms of prediction performance across all datasets and training sizes, suggesting that the shape of these curves depends on the unique model-dataset pair. The multi-input NN (mNN), in which gene expressions and molecular drug descriptors are input into separate subnetworks, outperforms a single-input NN (sNN), where the cell and drug features are concatenated for the input layer. In contrast, a GBDT with hyperparameter tuning exhibits superior performance as compared with both NNs at the lower range of training sizes for two of the datasets, whereas the mNN performs better at the higher range of training sizes. Moreover, the trajectory of the curves suggests that increasing the sample size is expected to further improve prediction scores of both NNs. These observations demonstrate the benefit of using learning curves to evaluate predictors, providing a broader perspective on the overall data scaling characteristics. The fitted power law curves provide a forward-looking performance metric and can serve as a co-design tool to guide experimental biologists and computational scientists in the design of future experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2018

Drug cell line interaction prediction

Understanding the phenotypic drug response on cancer cell lines plays a ...
research
04/30/2020

A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

By combining various cancer cell line (CCL) drug screening panels, the s...
research
11/16/2018

Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models

Drug resistance is still a major challenge in cancer therapy. Drug combi...
research
02/11/2018

Drug response prediction by ensemble learning and drug-induced gene expression signatures

Chemotherapeutic response of cancer cells to a given compound is one of ...
research
01/23/2018

Drug Selection via Joint Push and Learning to Rank

Selecting the right drugs for the right patients is a primary goal of pr...
research
03/19/2021

The Shape of Learning Curves: a Review

Learning curves provide insight into the dependence of a learner's gener...

Please sign up or login with your details

Forgot password? Click here to reset