Robustness to Missing Features using Hierarchical Clustering with Split Neural Networks

11/19/2020
by   Rishab Khincha, et al.
0

The problem of missing data has been persistent for a long time and poses a major obstacle in machine learning and statistical data analysis. Past works in this field have tried using various data imputation techniques to fill in the missing data, or training neural networks (NNs) with the missing data. In this work, we propose a simple yet effective approach that clusters similar input features together using hierarchical clustering and then trains proportionately split neural networks with a joint loss. We evaluate this approach on a series of benchmark datasets and show promising improvements even with simple imputation techniques. We attribute this to learning through clusters of similar features in our model architecture. The source code is available at https://github.com/usarawgi911/Robustness-to-Missing-Features

READ FULL TEXT

page 1

page 2

research
12/04/2015

Proposition of a Theoretical Model for Missing Data Imputation using Deep Learning and Evolutionary Algorithms

In the last couple of decades, there has been major advancements in the ...
research
02/12/2016

A Minimalistic Approach to Sum-Product Network Learning for Real Applications

Sum-Product Networks (SPNs) are a class of expressive yet tractable hier...
research
04/10/2023

Missing Data Imputation with Graph Laplacian Pyramid Network

Data imputation is a prevalent and important task due to the ubiquitousn...
research
09/25/2020

Why have a Unified Predictive Uncertainty? Disentangling it using Deep Split Ensembles

Understanding and quantifying uncertainty in black box Neural Networks (...
research
07/31/2021

Missingness Augmentation: A General Approach for Improving Generative Imputation Models

Despite tremendous progress in missing data imputation task, designing n...
research
11/27/2020

Clustering with missing data: which equivalent for Rubin's rules?

Multiple imputation (MI) is a popular method for dealing with missing va...
research
03/24/2023

Variational Inference for Longitudinal Data Using Normalizing Flows

This paper introduces a new latent variable generative model able to han...

Please sign up or login with your details

Forgot password? Click here to reset