Optimal classification and generalized prevalence estimates for diagnostic settings with more than two classes

10/05/2022
by   Rayanne A. Luke, et al.
0

An accurate multiclass classification strategy is crucial to interpreting antibody tests. However, traditional methods based on confidence intervals or receiver operating characteristics lack clear extensions to settings with more than two classes. We address this problem by developing a multiclass classification based on probabilistic modeling and optimal decision theory that minimizes the convex combination of false classification rates. The classification process is challenging when the relative fraction of the population in each class, or generalized prevalence, is unknown. Thus, we also develop a method for estimating the generalized prevalence of test data that is independent of classification. We validate our approach on serological data with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) naïve, previously infected, and vaccinated classes. Synthetic data are used to demonstrate that (i) prevalence estimates are unbiased and converge to true values and (ii) our procedure applies to arbitrary measurement dimensions. In contrast to the binary problem, the multiclass setting offers wide-reaching utility as the most general framework and provides new insight into prevalence estimation best practices.

READ FULL TEXT

page 14

page 27

research
12/18/2020

Classification Under Uncertainty: Data Analysis for Diagnostic Antibody Testing

Formulating accurate and robust classification strategies is a key chall...
research
03/24/2022

Minimizing Uncertainty in Prevalence Estimates

Estimating prevalence, the fraction of a population with a certain medic...
research
08/30/2023

Minimal Assumptions for Optimal Serology Classification: Theory and Implications for Multidimensional Settings and Impure Training Data

Minimizing error in prevalence estimates and diagnostic classifiers rema...
research
08/03/2022

Prevalence Estimation and Optimal Classification Methods to Account for Time Dependence in Antibody Levels

Serology testing can identify past infection by quantifying the immune r...
research
04/29/2019

Prevalence of international migration: an alternative for small area estimation

This paper introduces an alternative procedure for estimating the preval...
research
01/03/2022

A sampling scheme for estimating the prevalence of a pandemic

The spread of COVID-19 makes it essential to investigate its prevalence....

Please sign up or login with your details

Forgot password? Click here to reset