Teaching Drones on the Fly: Can Emotional Feedback Serve as Learning Signal for Training Artificial Agents?

02/19/2022
by   Manuela Pollak, et al.
0

We investigate whether naturalistic emotional human feedback can be directly exploited as a reward signal for training artificial agents via interactive human-in-the-loop reinforcement learning. To answer this question, we devise an experimental setting inspired by animal training, in which human test subjects interactively teach an emulated drone agent their desired command-action-mapping by providing emotional feedback on the drone's action selections. We present a first empirical proof-of-concept study and analysis confirming that human facial emotion expression can be directly exploited as reward signal in such interactive learning settings. Thereby, we contribute empirical findings towards more naturalistic and intuitive forms of reinforcement learning especially designed for non-expert users.

READ FULL TEXT

page 1

page 6

research
04/15/2019

Improving interactive reinforcement learning: What makes a good teacher?

Interactive reinforcement learning has become an important apprenticeshi...
research
10/14/2022

Multi-trainer Interactive Reinforcement Learning System

Interactive reinforcement learning can effectively facilitate the agent ...
research
01/15/2017

Agent-Agnostic Human-in-the-Loop Reinforcement Learning

Providing Reinforcement Learning agents with expert advice can dramatica...
research
01/23/2020

Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework

Interactive reinforcement learning provides a way for agents to learn to...
research
02/10/2019

Live Emoji: Semantic Emotional Expressiveness of 2D Live Animation

Live animation of 2D characters has recently become a popular way for st...
research
08/02/2019

Improving Deep Reinforcement Learning in Minecraft with Action Advice

Training deep reinforcement learning agents complex behaviors in 3D virt...
research
07/20/2023

Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback

Exploration and reward specification are fundamental and intertwined cha...

Please sign up or login with your details

Forgot password? Click here to reset