Generative and Discriminative Text Classification with Recurrent Neural Networks

03/06/2017
by   Dani Yogatama, et al.
0

We empirically characterize the performance of discriminative and generative LSTM models for text classification. We find that although RNN-based generative models are more powerful than their bag-of-words ancestors (e.g., they account for conditional dependencies across words in a document), they have higher asymptotic error rates than discriminatively trained RNN models. However we also find that generative models approach their asymptotic error rate more rapidly than their discriminative counterparts---the same pattern that Ng & Jordan (2001) proved holds for linear classification models that make more naive conditional independence assumptions. Building on this finding, we hypothesize that RNN-based generative classification models will be more robust to shifts in the data distribution. This hypothesis is confirmed in a series of experiments in zero-shot and continual learning settings that show that generative models substantially outperform discriminative models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2019

Conditional Generative Models are not Robust

Class-conditional generative models are an increasingly popular approach...
research
10/01/2019

Latent-Variable Generative Models for Data-Efficient Text Classification

Generative classifiers offer potential advantages over their discriminat...
research
01/31/2022

Deep discriminative to kernel generative modeling

The fight between discriminative versus generative goes deep, in both th...
research
08/31/2023

Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models

Zero-shot referring image segmentation is a challenging task because it ...
research
06/06/2022

Discriminative Models Can Still Outperform Generative Models in Aspect Based Sentiment Analysis

Aspect-based Sentiment Analysis (ABSA) helps to explain customers' opini...
research
05/10/2023

A Hybrid of Generative and Discriminative Models Based on the Gaussian-coupled Softmax Layer

Generative models have advantageous characteristics for classification t...
research
07/10/2018

Vision System for AGI: Problems and Directions

What frameworks and architectures are necessary to create a vision syste...

Please sign up or login with your details

Forgot password? Click here to reset