MetaRF: Differentiable Random Forest for Reaction Yield Prediction with a Few Trails

08/22/2022
by   Kexin Chen, et al.
0

Artificial intelligence has deeply revolutionized the field of medicinal chemistry with many impressive applications, but the success of these applications requires a massive amount of training samples with high-quality annotations, which seriously limits the wide usage of data-driven methods. In this paper, we focus on the reaction yield prediction problem, which assists chemists in selecting high-yield reactions in a new chemical space only with a few experimental trials. To attack this challenge, we first put forth MetaRF, an attention-based differentiable random forest model specially designed for the few-shot yield prediction, where the attention weight of a random forest is automatically optimized by the meta-learning framework and can be quickly adapted to predict the performance of new reagents while given a few additional samples. To improve the few-shot learning performance, we further introduce a dimension-reduction based sampling method to determine valuable samples to be experimentally tested and then learned. Our methodology is evaluated on three different datasets and acquires satisfactory performance on few-shot prediction. In high-throughput experimentation (HTE) datasets, the average yield of our methodology's top 10 high-yield reactions is relatively close to the results of ideal yield selection.

READ FULL TEXT
research
04/27/2022

Multimodal Transformer-based Model for Buchwald-Hartwig and Suzuki-Miyaura Reaction Yield Prediction

Predicting the yield percentage of a chemical reaction is useful in many...
research
11/28/2022

Data-driven multinomial random forest

In this paper, we strengthen the previous weak consistency proof method ...
research
02/20/2015

Feature-Budgeted Random Forest

We seek decision rules for prediction-time cost reduction, where complet...
research
08/06/2021

A Deep Neural Network Approach for Crop Selection and Yield Prediction in Bangladesh

Agriculture is the essential ingredients to mankind which is a major sou...
research
05/24/2019

HDI-Forest: Highest Density Interval Regression Forest

By seeking the narrowest prediction intervals (PIs) that satisfy the spe...
research
03/02/2023

A Meta-Learning Approach to Predicting Performance and Data Requirements

We propose an approach to estimate the number of samples required for a ...
research
09/26/2019

Deep Learning and Random Forest-Based Augmentation of sRNA Expression Profiles

The lack of well-structured annotations in a growing amount of RNA expre...

Please sign up or login with your details

Forgot password? Click here to reset