Tighter Confidence Intervals for Rating Systems

12/07/2019
by   Robert Nowak, et al.
0

Rating systems are ubiquitous, with applications ranging from product recommendation to teaching evaluations. Confidence intervals for functionals of rating data such as empirical means or quantiles are critical to decision-making in various applications including recommendation/ranking algorithms. Confidence intervals derived from standard Hoeffding and Bernstein bounds can be quite loose, especially in small sample regimes, since these bounds do not exploit the geometric structure of the probability simplex. We propose a new approach to deriving confidence intervals that are tailored to the geometry associated with multi-star/value rating systems using a combination of techniques from information theory, including Kullback-Leibler, Sanov, and Csiszár inequalities. The new confidence intervals are almost always as good or better than all standard methods and are significantly tighter in many situations. The standard bounds can require several times more samples than our new bounds to achieve specified confidence interval widths.

READ FULL TEXT

page 11

page 12

page 20

research
12/23/2022

Simple Buehler-optimal confidence intervals on the average success probability of independent Bernoulli trials

One-sided confidence intervals are presented for the average of non-iden...
research
01/20/2017

Multivariate Confidence Intervals

Confidence intervals are a popular way to visualize and analyze data dis...
research
06/04/2018

Confidence Interval Estimators for MOS Values

For the quantification of QoE, subjects often provide individual rating ...
research
07/27/2020

Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation

In reinforcement learning, it is typical to use the empirically observed...
research
03/05/2019

Concentration-based confidence intervals for U-statistics

Concentration inequalities have become increasingly popular in machine l...
research
08/15/2020

Accountable Off-Policy Evaluation With Kernel Bellman Statistics

We consider off-policy evaluation (OPE), which evaluates the performance...
research
07/09/2020

Fixed-time descriptive statistics underestimate extremes of epidemic curve ensembles

Across the world, scholars are racing to predict the spread of the novel...

Please sign up or login with your details

Forgot password? Click here to reset