Balancing thermal comfort datasets: We GAN, but should we?

09/28/2020
by   Matias Quintana, et al.
0

Thermal comfort assessment for the built environment has become more available to analysts and researchers due to the proliferation of sensors and subjective feedback methods. These data can be used for modeling comfort behavior to support design and operations towards energy efficiency and well-being. By nature, occupant subjective feedback is imbalanced as indoor conditions are designed for comfort, and responses indicating otherwise are less common. This situation creates a scenario for the machine learning workflow where class balancing as a pre-processing step might be valuable for developing predictive thermal comfort classification models with high-performance. This paper investigates the various thermal comfort dataset class balancing techniques from the literature and proposes a modified conditional Generative Adversarial Network (GAN), comfortGAN, to address this imbalance scenario. These approaches are applied to three publicly available datasets, ranging from 30 and 67 participants to a global collection of thermal comfort datasets, with 1,474; 2,067; and 66,397 data points, respectively. This work finds that a classification model trained on a balanced dataset, comprised of real and generated samples from comfortGAN, has higher performance (increase between 4 than other augmentation methods tested. However, when classes representing discomfort are merged and reduced to three, better imbalanced performance is expected, and the additional increase in performance by comfortGAN shrinks to 1-2 comfort modeling is beneficial using advanced techniques such as GANs, but its value is diminished in certain scenarios. A discussion is provided to assist potential users in determining which scenarios this process is useful and which method works best.

READ FULL TEXT
research
03/26/2018

BAGAN: Data Augmentation with Balancing GAN

Image classification datasets are often imbalanced, characteristic that ...
research
10/23/2022

Imbalanced Class Data Performance Evaluation and Improvement using Novel Generative Adversarial Network-based Approach: SSG and GBO

Class imbalance in a dataset is one of the major challenges that can sig...
research
08/06/2021

SMOTified-GAN for class imbalanced pattern classification problems

Class imbalance in a dataset is a major problem for classifiers that res...
research
06/17/2021

Class Balancing GAN with a Classifier in the Loop

Generative Adversarial Networks (GANs) have swiftly evolved to imitate i...
research
03/10/2022

Conditional Synthetic Data Generation for Personal Thermal Comfort Models

Personal thermal comfort models aim to predict an individual's thermal c...
research
05/16/2023

BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions

Class imbalanced problems (CIP) are one of the potential challenges in d...

Please sign up or login with your details

Forgot password? Click here to reset