Popular Ensemble Methods: An Empirical Study

06/01/2011
by   R. Maclin, et al.
0

An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more accurate than any of the single classifiers in the ensemble. Bagging (Breiman, 1996c) and Boosting (Freund and Shapire, 1996; Shapire, 1990) are two relatively new but popular methods for producing ensembles. In this paper we evaluate these methods on 23 data sets using both neural networks and decision trees as our classification algorithm. Our results clearly indicate a number of conclusions. First, while Bagging is almost always more accurate than a single classifier, it is sometimes much less accurate than Boosting. On the other hand, Boosting can create ensembles that are less accurate than a single classifier -- especially when using neural networks. Analysis indicates that the performance of the Boosting methods is dependent on the characteristics of the data set being examined. In fact, further results show that Boosting ensembles may overfit noisy data sets, thus decreasing its performance. Finally, consistent with previous studies, our work suggests that most of the gain in an ensemble's performance comes in the first few classifiers combined; however, relatively large gains can be seen up to 25 classifiers when Boosting decision trees.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2021

To Boost or not to Boost: On the Limits of Boosted Neural Networks

Boosting is a method for finding a highly accurate hypothesis by linearl...
research
09/07/2020

ECOC as a Method of Constructing Deep Convolutional Neural Network Ensembles

Deep neural networks have enhanced the performance of decision making sy...
research
06/16/2022

Explainable Models via Compression of Tree Ensembles

Ensemble models (bagging and gradient-boosting) of relational decision t...
research
03/08/2017

Structural Data Recognition with Graph Model Boosting

This paper presents a novel method for structural data recognition using...
research
11/28/2022

A Boosting Approach to Constructing an Ensemble Stack

An approach to evolutionary ensemble learning for classification is prop...
research
09/09/2014

eAnt-Miner : An Ensemble Ant-Miner to Improve the ACO Classification

Ant Colony Optimization (ACO) has been applied in supervised learning in...
research
09/02/2022

On Effectively Predicting Autism Spectrum Disorder Using an Ensemble of Classifiers

An ensemble of classifiers combines several single classifiers to delive...

Please sign up or login with your details

Forgot password? Click here to reset