Multimodal Transformer-based Model for Buchwald-Hartwig and Suzuki-Miyaura Reaction Yield Prediction

04/27/2022
by   Shimaa Baraka, et al.
0

Predicting the yield percentage of a chemical reaction is useful in many aspects such as reducing wet-lab experimentation by giving the priority to the reactions with a high predicted yield. In this work we investigated the use of multiple type inputs to predict chemical reaction yield. We used simplified molecular-input line-entry system (SMILES) as well as calculated chemical descriptors as model inputs. The model consists of a pre-trained bidirectional transformer-based encoder (BERT) and a multi-layer perceptron (MLP) with a regression head to predict the yield. We experimented on two high throughput experimentation (HTE) datasets for Buchwald-Hartwig and Suzuki-Miyaura reactions. The experiments show improvements in the prediction on both datasets compared to systems using only SMILES or chemical descriptors as input. We also tested the model's performance on out-of-sample dataset splits of Buchwald-Hartwig and achieved comparable results with the state-of-the-art. In addition to predicting the yield, we demonstrated the model's ability to suggest the optimum (highest yield) reaction conditions. The model was able to suggest conditions that achieves 94 proves the model to be useful in achieving the best results in the wet lab without expensive experimentation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2018

Molecular Transformer for Chemical Reaction Prediction and Uncertainty Estimation

Organic synthesis is one of the key stumbling blocks in medicinal chemis...
research
08/22/2022

MetaRF: Differentiable Random Forest for Reaction Yield Prediction with a Few Trails

Artificial intelligence has deeply revolutionized the field of medicinal...
research
04/25/2022

Predicting Real-time Scientific Experiments Using Transformer models and Reinforcement Learning

Life and physical sciences have always been quick to adopt the latest ad...
research
01/29/2022

Prediction of terephthalic acid (TPA) yield in aqueous hydrolysis of polyethylene terephthalate (PET)

Aqueous hydrolysis is used to chemically recycle polyethylene terephthal...
research
09/21/2021

Chemical-Reaction-Aware Molecule Representation Learning

Molecule representation learning (MRL) methods aim to embed molecules in...
research
06/08/2021

Non-Autoregressive Electron Redistribution Modeling for Reaction Prediction

Reliably predicting the products of chemical reactions presents a fundam...
research
05/06/2021

Dataset Bias in the Natural Sciences: A Case Study in Chemical Reaction Prediction and Synthesis Design

Datasets in the Natural Sciences are often curated with the goal of aidi...

Please sign up or login with your details

Forgot password? Click here to reset