Population-Based Black-Box Optimization for Biological Sequence Design

06/05/2020
by   Christof Angermueller, et al.
1

The use of black-box optimization for the design of new biological sequences is an emerging research area with potentially revolutionary impact. The cost and latency of wet-lab experiments requires methods that find good sequences in few experimental rounds of large batches of sequences–a setting that off-the-shelf black-box optimization methods are ill-equipped to handle. We find that the performance of existing methods varies drastically across optimization tasks, posing a significant obstacle to real-world applications. To improve robustness, we propose Population-Based Black-Box Optimization (P3BO), which generates batches of sequences by sampling from an ensemble of methods. The number of sequences sampled from any method is proportional to the quality of sequences it previously proposed, allowing P3BO to combine the strengths of individual methods while hedging against their innate brittleness. Adapting the hyper-parameters of each of the methods online using evolutionary optimization further improves performance. Through extensive experiments on in-silico optimization tasks, we show that P3BO outperforms any single method in its population, proposing higher quality sequences as well as more diverse batches. As such, P3BO and Adaptive-P3BO are a crucial step towards deploying ML to real-world sequence design.

READ FULL TEXT
research
10/06/2021

Unifying Likelihood-free Inference with Black-box Sequence Design and Beyond

Black-box optimization formulations for biological sequence design have ...
research
02/08/2022

Fourier Representations for Black-Box Optimization over Categorical Variables

Optimization of real-world black-box functions defined over purely categ...
research
03/18/2021

Learning How to Optimize Black-Box Functions With Extreme Limits on the Number of Function Evaluations

We consider black-box optimization in which only an extremely limited nu...
research
09/13/2022

Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization

The ability to accelerate the design of biological sequences can have a ...
research
11/06/2020

Explaining Differences in Classes of Discrete Sequences

While there are many machine learning methods to classify and cluster se...
research
10/29/2020

Black-Box Optimization of Object Detector Scales

Object detectors have improved considerably in the last years by using a...
research
04/19/2023

LEA: Beyond Evolutionary Algorithms via Learned Optimization Strategy

Evolutionary algorithms (EAs) have emerged as a powerful framework for e...

Please sign up or login with your details

Forgot password? Click here to reset