Handling Missing Data in Decision Trees: A Probabilistic Approach

06/29/2020
by   Pasha Khosravi, et al.
13

Decision trees are a popular family of models due to their attractive properties such as interpretability and ability to handle heterogeneous data. Concurrently, missing data is a prevalent occurrence that hinders performance of machine learning models. As such, handling missing data in decision trees is a well studied problem. In this paper, we tackle this problem by taking a probabilistic approach. At deployment time, we use tractable density estimators to compute the "expected prediction" of our models. At learning time, we fine-tune parameters of already learned trees by minimizing their "expected prediction loss" w.r.t. our density estimators. We provide brief experiments showcasing effectiveness of our methods compared to few baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2023

Trinary Decision Trees for missing value handling

This paper introduces the Trinary decision tree, an algorithm designed t...
research
09/18/2017

Early prediction of the duration of protests using probabilistic Latent Dirichlet Allocation and Decision Trees

Protests and agitations are an integral part of every democratic civil s...
research
06/25/2020

Joints in Random Forests

Decision Trees (DTs) and Random Forests (RFs) are powerful discriminativ...
research
02/13/2017

metboost: Exploratory regression analysis with hierarchically clustered data

As data collections become larger, exploratory regression analysis becom...
research
06/03/2013

Prediction with Missing Data via Bayesian Additive Regression Trees

We present a method for incorporating missing data in non-parametric sta...
research
08/20/2016

Reweighting with Boosted Decision Trees

Machine learning tools are commonly used in modern high energy physics (...
research
10/23/2020

An Analysis of LIME for Text Data

Text data are increasingly handled in an automated fashion by machine le...

Please sign up or login with your details

Forgot password? Click here to reset