Gaussian Process Molecule Property Prediction with FlowMO

by   Henry B. Moss, et al.

We present FlowMO: an open-source Python library for molecular property prediction with Gaussian Processes. Built upon GPflow and RDKit, FlowMO enables the user to make predictions with well-calibrated uncertainty estimates, an output central to active learning and molecular design applications. Gaussian Processes are particularly attractive for modelling small molecular datasets, a characteristic of many real-world virtual screening campaigns where high-quality experimental data is scarce. Computational experiments across three small datasets demonstrate comparable predictive performance to deep learning methods but with superior uncertainty calibration.


page 1

page 2

page 3

page 4


Scalable Gaussian Processes on Discrete Domains

Kernel methods on discrete domains have shown great promise for many cha...

Deeper Connections between Neural Networks and Gaussian Processes Speed-up Active Learning

Active learning methods for neural networks are usually based on greedy ...

Learning inducing points and uncertainty on molecular data

Uncertainty control and scalability to large datasets are the two main i...

Safe Active Learning for Multi-Output Gaussian Processes

Multi-output regression problems are commonly encountered in science and...

Mondrian Forests for Large-Scale Regression when Uncertainty Matters

Many real-world regression problems demand a measure of the uncertainty ...

A benchmark study on reliable molecular supervised learning via Bayesian learning

Virtual screening aims to find desirable compounds from chemical library...

MoleculeNet: A Benchmark for Molecular Machine Learning

Molecular machine learning has been maturing rapidly over the last few y...