Emergent and Unspecified Behaviors in Streaming Decision Trees

10/16/2020
by   Chaitanya Manapragada, et al.
0

Hoeffding trees are the state-of-the-art methods in decision tree learning for evolving data streams. These very fast decision trees are used in many real applications where data is created in real-time due to their efficiency. In this work, we extricate explanations for why these streaming decision tree algorithms for stationary and nonstationary streams (HoeffdingTree and HoeffdingAdaptiveTree) work as well as they do. In doing so, we identify thirteen unique unspecified design decisions in both the theoretical constructs and their implementations with substantial and consequential effects on predictive accuracy—design decisions that, without necessarily changing the essence of the algorithms, drive algorithm performance. We begin a larger conversation about explainability not just of the model but also of the processes responsible for an algorithm's success.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2020

Succinct Explanations With Cascading Decision Trees

Classic decision tree learning is a binary classification algorithm that...
research
07/28/2016

VHT: Vertical Hoeffding Tree

IoT Big Data requires new machine learning methods able to scale to larg...
research
02/05/2020

Fast inference of Boosted Decision Trees in FPGAs for particle physics

We describe the implementation of Boosted Decision Trees in the hls4ml l...
research
10/14/2020

Adaptive Deep Forest for Online Learning from Drifting Data Streams

Learning from data streams is among the most vital fields of contemporar...
research
10/16/2021

Streaming Decision Trees and Forests

Machine learning has successfully leveraged modern data and provided com...
research
04/20/2023

Comparative Analysis of Deterministic and Nondeterministic Decision Trees for Decision Tables from Closed Classes

In this paper, we consider classes of decision tables with many-valued d...
research
10/27/2020

Realization of Random Forest for Real-Time Evaluation through Tree Framing

The optimization of learning has always been of particular concern for b...

Please sign up or login with your details

Forgot password? Click here to reset