Are Deep Sequence Classifiers Good at Non-Trivial Generalization?

10/24/2022
by   Francesco Cazzaro, et al.
0

Recent advances in deep learning models for sequence classification have greatly improved their classification accuracy, specially when large training sets are available. However, several works have suggested that under some settings the predictions made by these models are poorly calibrated. In this work we study binary sequence classification problems and we look at model calibration from a different perspective by asking the question: Are deep learning models capable of learning the underlying target class distribution? We focus on sparse sequence classification, that is problems in which the target class is rare and compare three deep learning sequence classification models. We develop an evaluation that measures how well a classifier is learning the target class distribution. In addition, our evaluation disentangles good performance achieved by mere compression of the training sequences versus performance achieved by proper model generalization. Our results suggest that in this binary setting the deep-learning models are indeed able to learn the underlying class distribution in a non-trivial manner, i.e. by proper generalization beyond data compression.

READ FULL TEXT
research
06/16/2023

Multi-Classification using One-versus-One Deep Learning Strategy with Joint Probability Estimates

The One-versus-One (OvO) strategy is an approach of multi-classification...
research
06/27/2017

Cross-Country Skiing Gears Classification using Deep Learning

Human Activity Recognition has witnessed a significant progress in the l...
research
10/24/2022

Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup

Mixup is a data augmentation technique that relies on training using ran...
research
06/15/2023

Hierarchical confusion matrix for classification performance evaluation

In this work we propose a novel concept of a hierarchical confusion matr...
research
07/24/2018

Uncertainty Modelling in Deep Networks: Forecasting Short and Noisy Series

Deep Learning is a consolidated, state-of-the-art Machine Learning tool ...
research
06/24/2015

Learning Representations from Deep Networks Using Mode Synthesizers

Deep learning Networks play a crucial role in the evolution of a vast nu...
research
09/07/2016

Fitted Learning: Models with Awareness of their Limits

Though deep learning has pushed the boundaries of classification forward...

Please sign up or login with your details

Forgot password? Click here to reset