Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions

07/14/2023
by   Leo Klarner, et al.
0

Accelerating the discovery of novel and more effective therapeutics is an important pharmaceutical problem in which deep learning is playing an increasingly significant role. However, real-world drug discovery tasks are often characterized by a scarcity of labeled data and significant covariate shiftx2013x2013a setting that poses a challenge to standard deep learning methods. In this paper, we present Q-SAVI, a probabilistic model able to address these challenges by encoding explicit prior knowledge of the data-generating process into a prior distribution over functions, presenting researchers with a transparent and probabilistically principled way to encode data-driven modeling preferences. Building on a novel, gold-standard bioactivity dataset that facilitates a meaningful comparison of models in an extrapolative regime, we explore different approaches to induce data shift and construct a challenging evaluation setup. We then demonstrate that using Q-SAVI to integrate contextualized prior knowledge of drug-like chemical space into the modeling process affords substantial gains in predictive accuracy and calibration, outperforming a broad range of state-of-the-art self-supervised pre-training and domain adaptation techniques.

READ FULL TEXT
research
12/21/2020

Learn molecular representations from large-scale unlabeled molecules for drug discovery

How to produce expressive molecular representations is a fundamental cha...
research
03/12/2020

Meta-Learning Initializations for Low-Resource Drug Discovery

Building in silico models to predict chemical properties and activities ...
research
10/31/2022

Evaluating Point-Prediction Uncertainties in Neural Networks for Drug Discovery

Neural Network (NN) models provide potential to speed up the drug discov...
research
02/29/2020

Calibrated Prediction with Covariate Shift via Unsupervised Domain Adaptation

Reliable uncertainty estimates are an important tool for helping autonom...
research
07/03/2023

CardiGraphormer: Unveiling the Power of Self-Supervised Learning in Revolutionizing Drug Discovery

In the expansive realm of drug discovery, with approximately 15,000 know...
research
07/18/2022

Prior Knowledge Guided Unsupervised Domain Adaptation

The waive of labels in the target domain makes Unsupervised Domain Adapt...
research
06/03/2019

Deep Reasoning Networks: Thinking Fast and Slow

We introduce Deep Reasoning Networks (DRNets), an end-to-end framework t...

Please sign up or login with your details

Forgot password? Click here to reset