Data Mining using Unguided Symbolic Regression on a Blast Furnace Dataset

09/23/2013
by   Michael Kommenda, et al.
0

In this paper a data mining approach for variable selection and knowledge extraction from datasets is presented. The approach is based on unguided symbolic regression (every variable present in the dataset is treated as the target variable in multiple regression runs) and a novel variable relevance metric for genetic programming. The relevance of each input variable is calculated and a model approximating the target variable is created. The genetic programming configurations with different target variables are executed multiple times to reduce stochastic effects and the aggregated results are displayed as a variable interaction network. This interaction network highlights important system components and implicit relations between the variables. The whole approach is tested on a blast furnace dataset, because of the complexity of the blast furnace and the many interrelations between the variables. Finally the achieved results are discussed with respect to existing knowledge about the blast furnace process.

READ FULL TEXT
research
12/10/2012

Macro-Economic Time Series Modeling and Interaction Networks

Macro-economic models describe the dynamics of economic quantities. The ...
research
12/15/2014

GPTIPS 2: an open-source software platform for symbolic data mining

GPTIPS is a free, open source MATLAB based software platform for symboli...
research
09/13/2023

Racing Control Variable Genetic Programming for Symbolic Regression

Symbolic regression, as one of the most crucial tasks in AI for science,...
research
05/25/2023

Symbolic Regression via Control Variable Genetic Programming

Learning symbolic expressions directly from experiment data is a vital s...
research
04/25/2022

Transformation-Interaction-Rational Representation for Symbolic Regression

Symbolic Regression searches for a function form that approximates a dat...
research
04/22/2016

An improved chromosome formulation for genetic algorithms applied to variable selection with the inclusion of interaction terms

Genetic algorithms are a well-known method for tackling the problem of v...
research
05/05/2023

On the use of ordered factors as explanatory variables

Consider a regression or some regression-type model for a certain respon...

Please sign up or login with your details

Forgot password? Click here to reset