Automatic Classifiers as Scientific Instruments: One Step Further Away from Ground-Truth

12/19/2018
by   Jacob Whitehill, et al.
0

Automatic detectors of facial expression, gesture, affect, etc., can serve as scientific instruments to measure many behavioral and social phenomena (e.g., emotion, empathy, stress, engagement, etc.), and this has great potential to advance basic science. However, when a detector d is trained to approximate an existing measurement tool (e.g., observation protocol, questionnaire), then care must be taken when interpreting measurements collected using d since they are one step further removed from the underlying construct. We examine how the accuracy of d, as quantified by the correlation q of d's outputs with the ground-truth construct U, impacts the estimated correlation between U (e.g., stress) and some other phenomenon V (e.g., academic performance). In particular: (1) We show that if the true correlation between U and V is r, then the expected sample correlation, over all vectors T^n whose correlation with U is q, is qr. (2) We derive a formula to compute the probability that the sample correlation (over n subjects) using d is positive, given that the true correlation between U and V is negative (and vice-versa). We show that this probability is non-negligible (around 10-15%) for values of n and q that have been used in recent affective computing studies. (3) With the goal to reduce the variance of correlations estimated by an automatic detector, we show empirically that training multiple neural networks d^(1),...,d^(m) using different training configurations (e.g., architectures, hyperparameters) for the same detection task provides only limited `coverage' of T^n.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2022

AutoCOR: Autonomous Condylar Offset Ratio Calculator on TKA-Postoperative Lateral Knee X-ray

The postoperative range of motion is one of the crucial factors indicati...
research
04/13/2022

Does depth estimation help object detection?

Ground-truth depth, when combined with color data, helps improve object ...
research
07/01/2021

Investigating the Reliability of Self-report Survey in the Wild: The Quest for Ground Truth

Inferring human mental state (e.g., emotion, depression, engagement) wit...
research
11/08/2017

Inference of signals with unknown correlation structure from non-linear measurements

We present a method to reconstruct auto-correlated signals together with...
research
06/13/2017

Provable Alternating Gradient Descent for Non-negative Matrix Factorization with Strong Correlations

Non-negative matrix factorization is a basic tool for decomposing data i...
research
07/01/2013

An Empirical Study into Annotator Agreement, Ground Truth Estimation, and Algorithm Evaluation

Although agreement between annotators has been studied in the past from ...

Please sign up or login with your details

Forgot password? Click here to reset