Mitigating sampling bias in risk-based active learning via an EM algorithm

06/25/2022
by   Aidan J. Hughes, et al.
0

Risk-based active learning is an approach to developing statistical classifiers for online decision-support. In this approach, data-label querying is guided according to the expected value of perfect information for incipient data points. For SHM applications, the value of information is evaluated with respect to a maintenance decision process, and the data-label querying corresponds to the inspection of a structure to determine its health state. Sampling bias is a known issue within active-learning paradigms; this occurs when an active learning process over- or undersamples specific regions of a feature-space, thereby resulting in a training set that is not representative of the underlying distribution. This bias ultimately degrades decision-making performance, and as a consequence, results in unnecessary costs incurred. The current paper outlines a risk-based approach to active learning that utilises a semi-supervised Gaussian mixture model. The semi-supervised approach counteracts sampling bias by incorporating pseudo-labels for unlabelled data via an EM algorithm. The approach is demonstrated on a numerical example representative of the decision processes found in SHM.

READ FULL TEXT
research
06/23/2022

Improving decision-making via risk-based active learning: Probabilistic discriminative classifiers

Gaining the ability to make informed decisions on operation and maintena...
research
01/07/2022

On robust risk-based active-learning algorithms for enhanced decision support

Classification models are a fundamental component of physical-asset mana...
research
05/12/2021

On risk-based active learning for structural health monitoring

A primary motivation for the development and implementation of structura...
research
06/08/2021

Targeted Active Learning for Bayesian Decision-Making

Active learning is usually applied to acquire labels of informative data...
research
10/18/2021

A-Optimal Active Learning

In this work we discuss the problem of active learning. We present an ap...
research
01/27/2021

On Statistical Bias In Active Learning: How and When To Fix It

Active learning is a powerful tool when labelling data is expensive, but...
research
06/28/2019

L*-Based Learning of Markov Decision Processes (Extended Version)

Automata learning techniques automatically generate system models from t...

Please sign up or login with your details

Forgot password? Click here to reset