Is Big Data Sufficient for a Reliable Detection of Non-Technical Losses?

02/13/2017
by   Patrick Glauner, et al.
0

Non-technical losses (NTL) occur during the distribution of electricity in power grids and include, but are not limited to, electricity theft and faulty meters. In emerging countries, they may range up to 40 electricity distributed. In order to detect NTLs, machine learning methods are used that learn irregular consumption patterns from customer data and inspection results. The Big Data paradigm followed in modern machine learning reflects the desire of deriving better conclusions from simply analyzing more data, without the necessity of looking at theory and models. However, the sample of inspected customers may be biased, i.e. it does not represent the population of all customers. As a consequence, machine learning models trained on these inspection results are biased as well and therefore lead to unreliable predictions of whether customers cause NTL or not. In machine learning, this issue is called covariate shift and has not been addressed in the literature on NTL detection yet. In this work, we present a novel framework for quantifying and visualizing covariate shift. We apply it to a commercial data set from Brazil that consists of 3.6M customers and 820K inspection results. We show that some features have a stronger covariate shift than others, making predictions less reliable. In particular, previous inspections were focused on certain neighborhoods or customer classes and that they were not sufficiently spread among the population of customers. This framework is about to be deployed in a commercial product for NTL detection.

READ FULL TEXT

page 1

page 5

research
07/04/2016

Neighborhood Features Help Detecting Non-Technical Losses in Big Data Sets

Electricity theft is a major problem around the world in both developed ...
research
01/17/2018

On the Reduction of Biases in Big Data Sets for the Detection of Irregular Power Usage

In machine learning, a bias occurs whenever training sets are not repres...
research
03/02/2018

Impact of Biases in Big Data

The underlying paradigm of big data-driven machine learning reflects the...
research
02/05/2019

Efficient Power Theft Detection for Residential Consumers Using Mean Shift Data Mining Knowledge Discovery Process

Energy theft constitutes an issue of great importance for electricity op...
research
09/09/2017

Identifying Irregular Power Usage by Turning Predictions into Holographic Spatial Visualizations

Power grids are critical infrastructure assets that face non-technical l...
research
02/26/2016

Large-Scale Detection of Non-Technical Losses in Imbalanced Data Sets

Non-technical losses (NTL) such as electricity theft cause significant h...
research
01/24/2019

Modelling the Demand and Uncertainty of Low Voltage Networks and the Effect of non-Domestic Consumers

The increasing use and spread of low carbon technologies are expected to...

Please sign up or login with your details

Forgot password? Click here to reset