Multi-Fidelity Active Learning with GFlowNets

06/20/2023
by   Alex Hernández-García, et al.
0

In the last decades, the capacity to generate large amounts of data in science and engineering applications has been growing steadily. Meanwhile, the progress in machine learning has turned it into a suitable tool to process and utilise the available data. Nonetheless, many relevant scientific and engineering problems present challenges where current machine learning methods cannot yet efficiently leverage the available data and resources. For example, in scientific discovery, we are often faced with the problem of exploring very large, high-dimensional spaces, where querying a high fidelity, black-box objective function is very expensive. Progress in machine learning methods that can efficiently tackle such problems would help accelerate currently crucial areas such as drug and materials discovery. In this paper, we propose the use of GFlowNets for multi-fidelity active learning, where multiple approximations of the black-box function are available at lower fidelity and cost. GFlowNets are recently proposed methods for amortised probabilistic inference that have proven efficient for exploring large, high-dimensional spaces and can hence be practical in the multi-fidelity setting too. Here, we describe our algorithm for multi-fidelity active learning with GFlowNets and evaluate its performance in both well-studied synthetic tasks and practically relevant applications of molecular discovery. Our results show that multi-fidelity active learning with GFlowNets can efficiently leverage the availability of multiple oracles with different costs and fidelities to accelerate scientific discovery and engineering design.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2023

Disentangled Multi-Fidelity Deep Bayesian Active Learning

To balance quality and cost, various domain areas of science and enginee...
research
07/09/2020

Resource Aware Multifidelity Active Learning for Efficient Optimization

Traditional methods for black box optimization require a considerable nu...
research
02/01/2023

GFlowNets for AI-Driven Scientific Discovery

Tackling the most pressing problems for humanity, such as the climate cr...
research
11/06/2019

Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Discovering novel materials can be greatly accelerated by iterative mach...
research
12/02/2020

Deep Multi-Fidelity Active Learning of High-dimensional Outputs

Many applications, such as in physical simulation and engineering design...
research
10/08/2021

Opportunities for Machine Learning to Accelerate Halide Perovskite Commercialization and Scale-Up

While halide perovskites attract significant academic attention, example...
research
10/22/2021

GeneDisco: A Benchmark for Experimental Design in Drug Discovery

In vitro cellular experimentation with genetic interventions, using for ...

Please sign up or login with your details

Forgot password? Click here to reset