Confidence Bands and Hypothesis Test Methods for Recall and Precision Curves at Extremely Small Fractions with Applications to Drug Discovery

12/19/2019
by   Jeremy R. Ash, et al.
0

In virtual screening for drug discovery, recall curves are used to assess the performance of ranking algorithms, in which recall is a function of the fraction of data prioritized for experimental testing. Unfortunately, researchers almost never consider the uncertainty in the estimation of the recall curve when benchmarking algorithms. We confirm that a recently developed procedure for estimating pointwise confidence intervals for recall curves – and closely related variants, such as precision curves – can be applied to a variety of simulated data sets representative of those typically encountered in virtual screening. Since it is more desirable in benchmarks to present the uncertainty of performance over a range of testing fractions, we extend the pointwise confidence interval procedure to allow for the estimation of confidence bands for these curves. We also present hypothesis test methods to determine significant differences between the curves for competing algorithms. We show these methods have high power to detect significant differences at a range of small fractions typically tested, while maintaining control of type I error rate. These methods enable statistically rigorous comparisons of virtual screening algorithms using a metric that quantifies the aspect of performance that is of primary interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2023

Precision and Recall Reject Curves for Classification

For some classification scenarios, it is desirable to use only those cla...
research
06/27/2022

Honest Confidence Bands for Isotonic Quantile Curves

We provide confidence bands for isotonic quantile curves in nonparametri...
research
06/01/2020

Regression Enrichment Surfaces: a Simple Analysis Technique for Virtual Drug Screening Models

We present a new method for understanding the performance of a model in ...
research
05/14/2019

Revisiting Precision and Recall Definition for Generative Model Evaluation

In this article we revisit the definition of Precision-Recall (PR) curve...
research
06/18/2012

Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation

Precision-recall (PR) curves and the areas under them are widely used to...
research
06/11/2023

On the Confidence Intervals in Bioequivalence Studies

A bioequivalence study is a type of clinical trial designed to compare t...
research
11/02/2019

Correcting for attenuation due to measurement error

I present a frequentist method for quantifying uncertainty when correcti...

Please sign up or login with your details

Forgot password? Click here to reset