Boosting with Tempered Exponential Measures

06/08/2023
by   Richard Nock, et al.
0

One of the most popular ML algorithms, AdaBoost, can be derived from the dual of a relative entropy minimization problem subject to the fact that the positive weights on the examples sum to one. Essentially, harder examples receive higher probabilities. We generalize this setup to the recently introduced tempered exponential measures (TEMs) where normalization is enforced on a specific power of the measure and not the measure itself. TEMs are indexed by a parameter t and generalize exponential families (t=1). Our algorithm, t-AdaBoost, recovers AdaBoost as a special case (t=1). We show that t-AdaBoost retains AdaBoost's celebrated exponential convergence rate when t∈ [0,1) while allowing a slight improvement of the rate's hidden constant compared to t=1. t-AdaBoost partially computes on a generalization of classical arithmetic over the reals and brings notable properties like guaranteed bounded leveraging coefficients for t∈ [0,1). From the loss that t-AdaBoost minimizes (a generalization of the exponential loss), we show how to derive a new family of tempered losses for the induction of domain-partitioning classifiers like decision trees. Crucially, strict properness is ensured for all while their boosting rates span the full known spectrum. Experiments using t-AdaBoost+trees display that significant leverage can be achieved by tuning t.

READ FULL TEXT

page 10

page 42

research
07/04/2012

Obtaining Calibrated Probabilities from Boosting

Boosted decision trees typically yield good accuracy, precision, and ROC...
research
11/04/2022

Clustering above Exponential Families with Tempered Exponential Measures

The link with exponential families has allowed k-means clustering to be ...
research
07/26/2021

Tsallis and Rényi deformations linked via a new λ-duality

Tsallis and Rényi entropies, which are monotone transformations of such ...
research
09/05/2011

Learning Nonlinear Functions Using Regularized Greedy Forest

We consider the problem of learning a forest of nonlinear decision rules...
research
01/23/2013

Relative Loss Bounds for On-line Density Estimation with the Exponential Family of Distributions

We consider on-line density estimation with a parameterized density from...
research
01/26/2020

Boosted and Differentially Private Ensembles of Decision Trees

Boosted ensemble of decision tree (DT) classifiers are extremely popular...

Please sign up or login with your details

Forgot password? Click here to reset