Binary Excess Risk for Smooth Convex Surrogates

02/07/2014
by   Mehrdad Mahdavi, et al.
0

In statistical learning theory, convex surrogates of the 0-1 loss are highly preferred because of the computational and theoretical virtues that convexity brings in. This is of more importance if we consider smooth surrogates as witnessed by the fact that the smoothness is further beneficial both computationally- by attaining an optimal convergence rate for optimization, and in a statistical sense- by providing an improved optimistic rate for generalization bound. In this paper we investigate the smoothness property from the viewpoint of statistical consistency and show how it affects the binary excess risk. We show that in contrast to optimization and generalization errors that favor the choice of smooth surrogate loss, the smoothness of loss function may degrade the binary excess risk. Motivated by this negative result, we provide a unified analysis that integrates optimization error, generalization bound, and the error in translating convex excess risk into a binary excess risk when examining the impact of smoothness on the binary excess risk. We show that under favorable conditions appropriate choice of smooth convex loss will result in a binary excess risk that is better than O(1/√(n)).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2019

Stochastic Approximation of Smooth and Strongly Convex Functions: Beyond the O(1/T) Convergence Rate

Stochastic approximation (SA) is a classical approach for stochastic con...
research
02/27/2021

Learning with Smooth Hinge Losses

Due to the non-smoothness of the Hinge loss in SVM, it is difficult to o...
research
02/19/2018

Understanding the Loss Surface of Neural Networks for Binary Classification

It is widely conjectured that the reason that training algorithms for ne...
research
05/03/2014

On Lipschitz Continuity and Smoothness of Loss Functions in Learning to Rank

In binary classification and regression problems, it is well understood ...
research
09/02/2022

Optimizing the Performative Risk under Weak Convexity Assumptions

In performative prediction, a predictive model impacts the distribution ...
research
02/17/2021

Outside the Echo Chamber: Optimizing the Performative Risk

In performative prediction, predictions guide decision-making and hence ...
research
05/25/2023

How many samples are needed to leverage smoothness?

A core principle in statistical learning is that smoothness of target fu...

Please sign up or login with your details

Forgot password? Click here to reset