DeepAI AI Chat
Log In Sign Up

Mitigating sampling bias in risk-based active learning via an EM algorithm

06/25/2022
by   Aidan J. Hughes, et al.
The University of Sheffield
0

Risk-based active learning is an approach to developing statistical classifiers for online decision-support. In this approach, data-label querying is guided according to the expected value of perfect information for incipient data points. For SHM applications, the value of information is evaluated with respect to a maintenance decision process, and the data-label querying corresponds to the inspection of a structure to determine its health state. Sampling bias is a known issue within active-learning paradigms; this occurs when an active learning process over- or undersamples specific regions of a feature-space, thereby resulting in a training set that is not representative of the underlying distribution. This bias ultimately degrades decision-making performance, and as a consequence, results in unnecessary costs incurred. The current paper outlines a risk-based approach to active learning that utilises a semi-supervised Gaussian mixture model. The semi-supervised approach counteracts sampling bias by incorporating pseudo-labels for unlabelled data via an EM algorithm. The approach is demonstrated on a numerical example representative of the decision processes found in SHM.

READ FULL TEXT
06/23/2022

Improving decision-making via risk-based active learning: Probabilistic discriminative classifiers

Gaining the ability to make informed decisions on operation and maintena...
01/07/2022

On robust risk-based active-learning algorithms for enhanced decision support

Classification models are a fundamental component of physical-asset mana...
05/12/2021

On risk-based active learning for structural health monitoring

A primary motivation for the development and implementation of structura...
06/08/2021

Targeted Active Learning for Bayesian Decision-Making

Active learning is usually applied to acquire labels of informative data...
10/18/2021

A-Optimal Active Learning

In this work we discuss the problem of active learning. We present an ap...
12/26/2022

Online Active Learning for Soft Sensor Development using Semi-Supervised Autoencoders

Data-driven soft sensors are extensively used in industrial and chemical...
02/01/2023

Robust online active learning

In many industrial applications, obtaining labeled observations is not s...