Gryffin: An algorithm for Bayesian optimization for categorical variables informed by physical intuition with applications to chemistry

03/26/2020
by   Florian Häse, et al.
0

Designing functional molecules and advanced materials requires complex interdependent design choices: tuning continuous process parameters such as temperatures or flow rates, while simultaneously selecting categorical variables like catalysts or solvents. To date, the development of data-driven experiment planning strategies for autonomous experimentation has largely focused on continuous process parameters despite the urge to devise efficient strategies for the selection of categorical variables to substantially accelerate scientific discovery. We introduce Gryffin, as a general purpose optimization framework for the autonomous selection of categorical variables driven by expert knowledge. Gryffin augments Bayesian optimization with kernel density estimation using smooth approximations to categorical distributions. Leveraging domain knowledge from physicochemical descriptors to characterize categorical options, Gryffin can significantly accelerate the search for promising molecules and materials. Gryffin can further highlight relevant correlations between the provided descriptors to inspire physical insights and foster scientific intuition. In addition to comprehensive benchmarks, we demonstrate the capabilities and performance of Gryffin on three examples in materials science and chemistry: (i) the discovery of non-fullerene acceptors for organic solar cells, (ii) the design of hybrid organic-inorganic perovskites for light-harvesting, and (iii) the identification of ligands and process parameters for Suzuki-Miyaura reactions. Our observations suggest that Gryffin, in its simplest form without descriptors, constitutes a competitive categorical optimizer compared to state-of-the-art approaches. However, when leveraging domain knowledge provided via descriptors, Gryffin can optimize at considerable higher rates and refine this domain knowledge to spark scientific understanding.

READ FULL TEXT

page 3

page 6

page 9

page 10

page 20

page 23

page 25

research
03/29/2022

Bayesian optimization with known experimental and design constraints for chemistry applications

Optimization strategies driven by machine learning, such as Bayesian opt...
research
10/25/2019

Leveraging Legacy Data to Accelerate Materials Design via Preference Learning

Machine learning applications in materials science are often hampered by...
research
03/05/2021

Gemini: Dynamic Bias Correction for Autonomous Experimentation and Molecular Simulation

Bayesian optimization has emerged as a powerful strategy to accelerate s...
research
04/21/2017

High-Dimensional Materials and Process Optimization using Data-driven Experimental Design with Well-Calibrated Uncertainty Estimates

The optimization of composition and processing to obtain materials that ...
research
05/18/2023

Autonomous sputter synthesis of thin film nitrides with composition controlled by Bayesian optimization of optical plasma emission

Autonomous experimentation has emerged as an efficient approach to accel...
research
08/22/2023

HypBO: Expert-Guided Chemist-in-the-Loop Bayesian Search for New Materials

Robotics and automation offer massive accelerations for solving intracta...
research
12/01/2022

To think inside the box, or to think out of the box? Scientific discovery via the reciprocation of insights and concepts

If scientific discovery is one of the main driving forces of human progr...

Please sign up or login with your details

Forgot password? Click here to reset