Missing Data Prediction and Classification: The Use of Auto-Associative Neural Networks and Optimization Algorithms

03/21/2014
by   Collins Leke, et al.
0

This paper presents methods which are aimed at finding approximations to missing data in a dataset by using optimization algorithms to optimize the network parameters after which prediction and classification tasks can be performed. The optimization methods that are considered are genetic algorithm (GA), simulated annealing (SA), particle swarm optimization (PSO), random forest (RF) and negative selection (NS) and these methods are individually used in combination with auto-associative neural networks (AANN) for missing data estimation and the results obtained are compared. The methods suggested use the optimization algorithms to minimize an error function derived from training the auto-associative neural network during which the interrelationships between the inputs and the outputs are obtained and stored in the weights connecting the different layers of the network. The error function is expressed as the square of the difference between the actual observations and predicted values from an auto-associative neural network. In the event of missing data, all the values of the actual observations are not known hence, the error function is decomposed to depend on the known and unknown variable values. Multi-layer perceptron (MLP) neural network is employed to train the neural networks using the scaled conjugate gradient (SCG) method. Prediction accuracy is determined by mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), and correlation coefficient (r) computations. Accuracy in classification is obtained by plotting ROC curves and calculating the areas under these. Analysis of results depicts that the approach using RF with AANN produces the most accurate predictions and classifications while on the other end of the scale is the approach which entails using NS with AANN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2020

A Bayesian regularized feed-forward neural network model for conductivity prediction of PS/MWCNT nanocomposite film coatings

In our present work, a multi-layered feed-forward neural network (FFNN) ...
research
07/29/2021

Artificial Intelligence Hybrid Deep Learning Model for Groundwater Level Prediction Using MLP-ADAM

Groundwater is the largest storage of freshwater resources, which serves...
research
08/27/2018

Combining Predictions of Auto Insurance Claims

This paper aims at achieving better performance of prediction by combini...
research
01/03/2017

New Methods of Enhancing Prediction Accuracy in Linear Models with Missing Data

In this paper, prediction for linear systems with missing information is...
research
11/05/2019

The correlation-assisted missing data estimator

We introduce a novel approach to estimation problems in settings with mi...
research
02/01/2021

Basis Function Based Data Driven Learning for the Inverse Problem of Electrocardiography

Objective: This paper proposes an neural network approach for predicting...
research
07/19/2021

A Modulation Layer to Increase Neural Network Robustness Against Data Quality Issues

Data quality is a common problem in machine learning, especially in high...

Please sign up or login with your details

Forgot password? Click here to reset