Learning User Preferences via Reinforcement Learning with Spatial Interface Valuing

02/02/2019
by   Miguel Alonso Jr, et al.
0

Interactive Machine Learning is concerned with creating systems that operate in environments alongside humans to achieve a task. A typical use is to extend or amplify the capabilities of a human in cognitive or physical ways, requiring the machine to adapt to the users' intentions and preferences. Often, this takes the form of a human operator providing some type of feedback to the user, which can be explicit feedback, implicit feedback, or a combination of both. Explicit feedback, such as through a mouse click, carries a high cognitive load. The focus of this study is to extend the current state of the art in interactive machine learning by demonstrating that agents can learn a human user's behavior and adapt to preferences with a reduced amount of explicit human feedback in a mixed feedback setting. The learning agent perceives a value of its own behavior from hand gestures given via a spatial interface. This feedback mechanism is termed Spatial Interface Valuing. This method is evaluated experimentally in a simulated environment for a grasping task using a robotic arm with variable grip settings. Preliminary results indicate that learning agents using spatial interface valuing can learn a value function mapping spatial gestures to expected future rewards much more quickly as compared to those same agents just receiving explicit feedback, demonstrating that an agent perceiving feedback from a human user via a spatial interface can serve as an effective complement to existing approaches.

READ FULL TEXT

page 6

page 9

research
06/09/2016

Face valuing: Training user interfaces with facial expressions and reinforcement learning

An important application of interactive machine learning is extending or...
research
03/23/2020

On Interactive Machine Learning and the Potential of Cognitive Feedback

In order to increase productivity, capability, and data exploitation, nu...
research
05/16/2023

Continually Learned Pavlovian Signalling Without Forgetting for Human-in-the-Loop Robotic Control

Artificial limbs are sophisticated devices to assist people with tasks o...
research
03/10/2019

Exploration of Interesting Dense Regions in Spatial Data

Nowadays, spatial data are ubiquitous in various fields of science, such...
research
08/08/2021

Learning Proxemic Behavior Using Reinforcement Learning with Cognitive Agents

Proxemics is a branch of non-verbal communication concerned with studyin...
research
08/08/2014

Using Learned Predictions as Feedback to Improve Control and Communication with an Artificial Limb: Preliminary Findings

Many people suffer from the loss of a limb. Learning to get by without a...
research
06/06/2019

The Open Vault Challenge -- Learning how to build calibration-free interactive systems by cracking the code of a vault

This demo takes the form of a challenge to the IJCAI community. A physic...

Please sign up or login with your details

Forgot password? Click here to reset