Exploring Bayesian Surprise to Prevent Overfitting and to Predict Model Performance in Non-Intrusive Load Monitoring

09/16/2020
by   Richard Jones, et al.
0

Non-Intrusive Load Monitoring (NILM) is a field of research focused on segregating constituent electrical loads in a system based only on their aggregated signal. Significant computational resources and research time are spent training models, often using as much data as possible, perhaps driven by the preconception that more data equates to more accurate models and better performing algorithms. When has enough prior training been done? When has a NILM algorithm encountered new, unseen data? This work applies the notion of Bayesian surprise to answer these questions which are important for both supervised and unsupervised algorithms. We quantify the degree of surprise between the predictive distribution (termed postdictive surprise), as well as the transitional probabilities (termed transitional surprise), before and after a window of observations. We compare the performance of several benchmark NILM algorithms supported by NILMTK, in order to establish a useful threshold on the two combined measures of surprise. We validate the use of transitional surprise by exploring the performance of a popular Hidden Markov Model as a function of surprise threshold. Finally, we explore the use of a surprise threshold as a regularization technique to avoid overfitting in cross-dataset performance. Although the generality of the specific surprise threshold discussed herein may be suspect without further testing, this work provides clear evidence that a point of diminishing returns of model performance with respect to dataset size exists. This has implications for future model development, dataset acquisition, as well as aiding in model flexibility during deployment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/05/2019

Diversified Hidden Markov Models for Sequential Labeling

Labeling of sequential data is a prevalent meta-problem for a wide range...
research
12/12/2019

On Metrics to Assess the Transferability of Machine Learning Models in Non-Intrusive Load Monitoring

To assess the performance of load disaggregation algorithms it is common...
research
08/07/2023

Revealing the Underlying Patterns: Investigating Dataset Similarity, Performance, and Generalization

Supervised deep learning models require significant amount of labelled d...
research
08/31/2020

Optimal Bayesian Quickest Detection for Hidden Markov Models and Structured Generalisations

In this paper we consider the problem of quickly detecting changes in hi...
research
02/05/2018

On the Feasibility of Generic Deep Disaggregation for Single-Load Extraction

Recently, and with the growing development of big energy datasets, data-...
research
04/17/2012

On how percolation threshold affects PSO performance

Statistical evidence of the influence of neighborhood topology on the pe...

Please sign up or login with your details

Forgot password? Click here to reset