Error rate control for classification rules in multiclass mixture models

09/29/2021
by   Tristan Mary-Huard, et al.
0

In the context of finite mixture models one considers the problem of classifying as many observations as possible in the classes of interest while controlling the classification error rate in these same classes. Similar to what is done in the framework of statistical test theory, different type I and type II-like classification error rates can be defined, along with their associated optimal rules, where optimality is defined as minimizing type II error rate while controlling type I error rate at some nominal level. It is first shown that finding an optimal classification rule boils down to searching an optimal region in the observation space where to apply the classical Maximum A Posteriori (MAP) rule. Depending on the misclassification rate to be controlled, the shape of the optimal region is provided, along with a heuristic to compute the optimal classification rule in practice. In particular, a multiclass FDR-like optimal rule is defined and compared to the thresholded MAP rules that is used in most applications. It is shown on both simulated and real datasets that the FDR-like optimal rule may be significantly less conservative than the thresholded MAP rule.

READ FULL TEXT

page 20

page 31

research
07/10/2013

Error Rate Bounds in Crowdsourcing Models

Crowdsourcing is an effective tool for human-powered computation on many...
research
12/30/2017

Adaptive Sign Error Control

In multiple testing scenarios, typically the sign of a parameter is infe...
research
11/15/2014

Error Rate Bounds and Iterative Weighted Majority Voting for Crowdsourcing

Crowdsourcing has become an effective and popular tool for human-powered...
research
07/10/2023

Beyond the Two-Trials Rule

The two-trials rule for drug approval requires "at least two adequate an...
research
06/06/2021

Hierarchical Bayesian Mixture Models for Time Series Using Context Trees as State Space Partitions

A general Bayesian framework is introduced for mixture modelling and inf...
research
06/16/2022

On Error and Compression Rates for Prototype Rules

We study the close interplay between error and compression in the non-pa...
research
07/02/2019

The Ethical Dilemma when (not) Setting up Cost-based Decision Rules in Semantic Segmentation

Neural networks for semantic segmentation can be seen as statistical mod...

Please sign up or login with your details

Forgot password? Click here to reset