Revisiting Discriminative vs. Generative Classifiers: Theory and Implications

02/05/2023
by   Chenyu Zheng, et al.
0

A large-scale deep model pre-trained on massive labeled or unlabeled data transfers well to downstream tasks. Linear evaluation freezes parameters in the pre-trained model and trains a linear classifier separately, which is efficient and attractive for transfer. However, little work has investigated the classifier in linear evaluation except for the default logistic regression. Inspired by the statistical efficiency of naive Bayes, the paper revisits the classical topic on discriminative vs. generative classifiers. Theoretically, the paper considers the surrogate loss instead of the zero-one loss in analyses and generalizes the classical results from binary cases to multiclass ones. We show that, under mild assumptions, multiclass naive Bayes requires O(log n) samples to approach its asymptotic error while the corresponding multiclass logistic regression requires O(n) samples, where n is the feature dimension. To establish it, we present a multiclass ℋ-consistency bound framework and an explicit bound for logistic loss, which are of independent interests. Simulation results on a mixture of Gaussian validate our theoretical findings. Experiments on various pre-trained deep vision models show that naive Bayes consistently converges faster as the number of data increases. Besides, naive Bayes shows promise in few-shot cases and we observe the “two regimes” phenomenon in pre-trained supervised models. Our code is available at https://github.com/ML-GSAI/Revisiting-Dis-vs-Gen-Classifiers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2020

Using the Naive Bayes as a discriminative classifier

For classification tasks, probabilistic models can be categorized into t...
research
05/23/2019

Naive Feature Selection: Sparsity in Naive Bayes

Due to its linear complexity, naive Bayes classification remains an attr...
research
01/03/2022

Deriving discriminative classifiers from generative models

We deal with Bayesian generative and discriminative classifiers. Given a...
research
08/29/2023

Read-only Prompt Optimization for Vision-Language Few-shot Learning

In recent years, prompt tuning has proven effective in adapting pre-trai...
research
06/05/2023

Enhancing naive classifier for positive unlabeled data based on logistic regression approach

We argue that for analysis of Positive Unlabeled (PU) data under Selecte...
research
11/22/2021

A Semi-Supervised Adaptive Discriminative Discretization Method Improving Discrimination Power of Regularized Naive Bayes

Recently, many improved naive Bayes methods have been developed with enh...
research
02/16/2021

Constructing Multiclass Classifiers using Binary Classifiers Under Log-Loss

The construction of multiclass classifiers from binary classifiers is st...

Please sign up or login with your details

Forgot password? Click here to reset