Personal Comfort Estimation in Partial Observable Environment using Reinforcement Learning

12/02/2021
by   Shashi Suman, et al.
0

The technology used in smart homes have improved to learn the user preferences from feedbacks in order to provide convenience to the user in the home environment. Most smart homes learn a uniform model to represent the thermal preference of user which generally fails when the pool of occupants includes people having different age, gender, and location. Having different thermal sensation for each user poses a challenge for the smart homes to learn a personalized preference for each occupant without forgetting the policy of others. A smart home with single optimal policy may fail to provide comfort when a new user with different preference is integrated in the home. In this paper, we propose POSHS, a Bayesian Reinforcement learning algorithm that can approximate the current occupant state in a partial observable environment using its thermal preference and then decide if its a new occupant or belongs to the pool of previously observed users. We then compare POSHS algorithm with an LSTM based algorithm to learn and estimate the current state of the occupant while also taking optimal actions to reduce the timesteps required to set the preferences. We perform these experiments with upto 5 simulated human models each based on hierarchical reinforcement learning. The results show that POSHS can approximate the current user state just from its temperature and humidity preference and also reduce the number of time-steps required to set optimal temperature and humidity by the human model in the presence of the smart home.

READ FULL TEXT

page 5

page 14

research
02/26/2021

Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach

We aim to investigate the potential impacts of smart homes on human beha...
research
09/13/2020

Identifying Grey-box Thermal Models with Bayesian Neural Networks

Smart thermostats are one of the most prevalent home automation products...
research
06/07/2019

Preference-based Interactive Multi-Document Summarisation

Interactive NLP is a promising paradigm to close the gap between automat...
research
06/16/2020

Preference-based Reinforcement Learning with Finite-Time Guarantees

Preference-based Reinforcement Learning (PbRL) replaces reward values in...
research
03/21/2019

Learning Personalized Thermal Preferences via Bayesian Active Learning with Unimodality Constraints

Thermal preferences vary from person to person and may change over time....
research
08/05/2022

Cohort comfort models – Using occupants' similarity to predict personal thermal preference with less data

We introduce Cohort Comfort Models, a new framework for predicting how n...
research
11/11/2020

Bayesian model of electrical heating disaggregation

Adoption of smart meters is a major milestone on the path of European tr...

Please sign up or login with your details

Forgot password? Click here to reset