A Feature Importance Analysis for Soft-Sensing-Based Predictions in a Chemical Sulphonation Process

09/25/2020
by   Enrique Garcia-Ceja, et al.
0

In this paper we present the results of a feature importance analysis of a chemical sulphonation process. The task consists of predicting the neutralization number (NT), which is a metric that characterizes the product quality of active detergents. The prediction is based on a dataset of environmental measurements, sampled from an industrial chemical process. We used a soft-sensing approach, that is, predicting a variable of interest based on other process variables, instead of directly sensing the variable of interest. Reasons for doing so range from expensive sensory hardware to harsh environments, e.g., inside a chemical reactor. The aim of this study was to explore and detect which variables are the most relevant for predicting product quality, and to what degree of precision. We trained regression models based on linear regression, regression tree and random forest. A random forest model was used to rank the predictor variables by importance. Then, we trained the models in a forward-selection style by adding one feature at a time, starting with the most important one. Our results show that it is sufficient to use the top 3 important variables, out of the 8 variables, to achieve satisfactory prediction results. On the other hand, Random Forest obtained the best result when trained with all variables.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2020

Towards the Automation of a Chemical Sulphonation Process with Machine Learning

Nowadays, the continuous improvement and automation of industrial proces...
research
06/24/2016

Regression Trees and Random forest based feature selection for malaria risk exposure prediction

This paper deals with prediction of anopheles number, the main vector of...
research
10/12/2021

Predicting the Stereoselectivity of Chemical Transformations by Machine Learning

Stereoselective reactions (both chemical and enzymatic reactions) have b...
research
12/05/2019

Asymptotic Unbiasedness of the Permutation Importance Measure in Random Forest Models

Variable selection in sparse regression models is an important task as a...
research
02/08/2022

A Unified Prediction Framework for Signal Maps

Signal maps are essential for the planning and operation of cellular net...
research
03/02/2023

A Notion of Feature Importance by Decorrelation and Detection of Trends by Random Forest Regression

In many studies, we want to determine the influence of certain features ...
research
02/10/2021

Feature Analyses and Modelling of Lithium-ion Batteries Manufacturing based on Random Forest Classification

Lithium-ion battery manufacturing is a highly complicated process with s...

Please sign up or login with your details

Forgot password? Click here to reset