Sparse multivariate regression with missing values and its application to the prediction of material properties

03/17/2021
by   Keisuke Teramoto, et al.
0

In the field of materials science and engineering, statistical analysis and machine learning techniques have recently been used to predict multiple material properties from an experimental design. These material properties correspond to response variables in the multivariate regression model. This study conducts a penalized maximum likelihood procedure to estimate model parameters, including the regression coefficients and covariance matrix of response variables. In particular, we employ l_1-regularization to achieve a sparse estimation of regression coefficients and the inverse covariance matrix of response variables. In some cases, there may be a relatively large number of missing values in response variables, owing to the difficulty in collecting data on material properties. A method to improve prediction accuracy under the situation with missing values incorporates a correlation structure among the response variables into the statistical model. The expectation and maximization algorithm is constructed, which enables application to a data set with missing values in the responses. We apply our proposed procedure to real data consisting of 22 material properties.

READ FULL TEXT
research
06/19/2013

Joint estimation of sparse multivariate regression and conditional graphical models

Multivariate regression model is a natural generalization of the classic...
research
09/19/2020

Stochastic Threshold Model Trees: A Tree-Based Ensemble Method for Dealing with Extrapolation

In the field of chemistry, there have been many attempts to predict the ...
research
05/21/2022

Multivariate generalized linear mixed models for underdispersed count data

Researchers are often interested in understanding the relationship betwe...
research
07/22/2021

Robust low-rank covariance matrix estimation with a general pattern of missing values

This paper tackles the problem of robust covariance matrix estimation wh...
research
06/21/2022

Conditional probability tensor decompositions for multivariate categorical response regression

In many modern regression applications, the response consists of multipl...
research
06/09/2021

On the Use of Minimum Penalties in Statistical Learning

Modern multivariate machine learning and statistical methodologies estim...
research
08/31/2018

An explicit mean-covariance parameterization for multivariate response linear regression

We develop a new method to fit the multivariate response linear regressi...

Please sign up or login with your details

Forgot password? Click here to reset