Stochastic Threshold Model Trees: A Tree-Based Ensemble Method for Dealing with Extrapolation

09/19/2020
by   Kohei Numata, et al.
9

In the field of chemistry, there have been many attempts to predict the properties of unknown compounds from statistical models constructed using machine learning. In an area where many known compounds are present (the interpolation area), an accurate model can be constructed. In contrast, data in areas where there are no known compounds (the extrapolation area) are generally difficult to predict. However, in the development of new materials, it is desirable to search this extrapolation area and discover compounds with unprecedented physical properties. In this paper, we propose Stochastic Threshold Model Trees (STMT), an extrapolation method that reflects the trend of the data, while maintaining the accuracy of conventional interpolation methods. The behavior of STMT is confirmed through experiments using both artificial and real data. In the case of the real data, although there is no significant overall improvement in accuracy, there is one compound for which the prediction accuracy is notably improved, suggesting that STMT reflects the data trends in the extrapolation area. We believe that the proposed method will contribute to more efficient searches in situations such as new material development.

READ FULL TEXT

page 1

page 8

page 9

page 10

research
03/17/2021

Sparse multivariate regression with missing values and its application to the prediction of material properties

In the field of materials science and engineering, statistical analysis ...
research
02/02/2018

Stochastic Kriging for Inadequate Simulation Models

Stochastic kriging is a popular metamodeling technique for representing ...
research
08/24/2019

Accelerating small-angle scattering experiments with simulation-based machine learning

Making material experiments more efficient is a high priority for materi...
research
06/16/2022

Hardness prediction of age-hardening aluminum alloy based on ensemble learning

With the rapid development of artificial intelligence, the combination o...
research
02/27/2021

Machine Learning Techniques to Construct Patched Analog Ensembles for Data Assimilation

Using generative models from the machine learning literature to create a...
research
04/15/2022

Intelligent Spatial Interpolation-based Frost Prediction Methodology using Artificial Neural Networks with Limited Local Data

The weather phenomenon of frost poses great threats to agriculture. Sinc...
research
03/27/2021

Particle Filter Bridge Interpolation

Auto encoding models have been extensively studied in recent years. They...

Please sign up or login with your details

Forgot password? Click here to reset