hi-RF: Incremental Learning Random Forest for large-scale multi-class Data Classification

08/31/2016
by   Tingting Xie, et al.
0

In recent years, dynamically growing data and incrementally growing number of classes pose new challenges to large-scale data classification research. Most traditional methods struggle to balance the precision and computational burden when data and its number of classes increased. However, some methods are with weak precision, and the others are time-consuming. In this paper, we propose an incremental learning method, namely, heterogeneous incremental Nearest Class Mean Random Forest (hi-RF), to handle this issue. It is a heterogeneous method that either replaces trees or updates trees leaves in the random forest adaptively, to reduce the computational time in comparable performance, when data of new classes arrive. Specifically, to keep the accuracy, one proportion of trees are replaced by new NCM decision trees; to reduce the computational load, the rest trees are updated their leaves probabilities only. Most of all, out-of-bag estimation and out-of-bag boosting are proposed to balance the accuracy and the computational efficiency. Fair experiments were conducted and demonstrated its comparable precision with much less computational time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2018

A Dynamic Boosted Ensemble Learning Method Based on Random Forest

We propose a dynamic boosted ensemble learning method based on random fo...
research
11/28/2022

Data-driven multinomial random forest

In this paper, we strengthen the previous weak consistency proof method ...
research
12/26/2020

Explainable Multi-class Classification of Medical Data

Machine Learning applications have brought new insights into a secondary...
research
09/02/2019

Guided Random Forest and its application to data approximation

We present a new way of constructing an ensemble classifier, named the G...
research
01/08/2018

Deep Nearest Class Mean Model for Incremental Odor Classification

In recent years, more and more machine learning algorithms have been app...
research
05/16/2017

To tune or not to tune the number of trees in random forest?

The number of trees T in the random forest (RF) algorithm for supervised...
research
07/19/2018

A Projection Pursuit Forest Algorithm for Supervised Classification

This paper presents a new ensemble learning method for classification pr...

Please sign up or login with your details

Forgot password? Click here to reset