Waveform Signal Entropy and Compression Study of Whole-Building Energy Datasets

10/25/2018
by   Thomas Kriechbaumer, et al.
0

Electrical energy consumption has been an ongoing research area since the coming of smart homes and Internet of Things devices. Consumption characteristics and usages profiles are directly influenced by building occupants and their interaction with electrical appliances. Extracted information from these data can be used to conserve energy and increase user comfort levels. Data analysis together with machine learning models can be utilized to extract valuable information for the benefit of occupants themselves, power plants, and grid operators. Public energy datasets provide a scientific foundation to develop and benchmark these algorithms and techniques. With datasets exceeding tens of terabytes, we present a novel study of five whole-building energy datasets with high sampling rates, their signal entropy, and how a well-calibrated measurement can have a significant effect on the overall storage requirements. We show that some datasets do not fully utilize the available measurement precision, therefore leaving potential accuracy and space savings untapped. We benchmark a comprehensive list of 365 file formats, transparent data transformations, and lossless compression algorithms. The primary goal is to reduce the overall dataset size while maintaining an easy-to-use file format and access API. We show that with careful selection of file format and encoding scheme, we can reduce the size of some datasets by up to 73

READ FULL TEXT
research
11/23/2021

Health Detection on Cattle Compressed Images in Precision Livestock Farming

The constant population growth brings the needing to make up for food al...
research
05/27/2018

IoT for Green Building Management

Buildings consume 60 management systems (BMSs) are highly expensive and ...
research
09/17/2020

Building power consumption datasets: Survey, taxonomy and future directions

In the last decade, extended efforts have been poured into energy effici...
research
11/30/2021

RawArray: A Simple, Fast, and Extensible Archival Format for Numeric Data

Raw data sizes are growing and proliferating in scientific research, dri...
research
09/05/2022

Spatial Parquet: A Column File Format for Geospatial Data Lakes [Extended Version]

Modern data analytics applications prefer to use column-storage formats ...
research
08/03/2021

Ontology Modeling for Decentralized Household Energy Systems

In a decentralized household energy system consisting of various devices...

Please sign up or login with your details

Forgot password? Click here to reset