On the Reduction of Biases in Big Data Sets for the Detection of Irregular Power Usage

01/17/2018
by   Patrick Glauner, et al.
0

In machine learning, a bias occurs whenever training sets are not representative for the test data, which results in unreliable models. The most common biases in data are arguably class imbalance and covariate shift. In this work, we aim to shed light on this topic in order to increase the overall attention to this issue in the field of machine learning. We propose a scalable novel framework for reducing multiple biases in high-dimensional data sets in order to train more reliable predictors. We apply our methodology to the detection of irregular power usage from real, noisy industrial data. In emerging markets, irregular power usage, and electricity theft in particular, may range up to 40 of particular issue in this domain. We show that reducing these biases increases the accuracy of the trained predictors. Our models have the potential to generate significant economic value in a real world application, as they are being deployed in a commercial software for the detection of irregular power usage.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2017

Is Big Data Sufficient for a Reliable Detection of Non-Technical Losses?

Non-technical losses (NTL) occur during the distribution of electricity ...
research
03/02/2018

Impact of Biases in Big Data

The underlying paradigm of big data-driven machine learning reflects the...
research
09/09/2017

Identifying Irregular Power Usage by Turning Predictions into Holographic Spatial Visualizations

Power grids are critical infrastructure assets that face non-technical l...
research
02/18/2022

Testing the boundaries: Normalizing Flows for higher dimensional data sets

Normalizing Flows (NFs) are emerging as a powerful class of generative m...
research
06/15/2020

Societal biases reinforcement through machine learning: A credit scoring perspective

Does machine learning and AI ensure that social biases thrive ? This pap...
research
10/10/2022

A survey of Identification and mitigation of Machine Learning algorithmic biases in Image Analysis

The problem of algorithmic bias in machine learning has gained a lot of ...
research
04/11/2019

Reducing Lateral Visual Biases in Displays

The human visual system is composed of multiple physiological components...

Please sign up or login with your details

Forgot password? Click here to reset