Maximizing BCI Human Feedback using Active Learning

by   Zizhao Wang, et al.

Recent advancements in Learning from Human Feedback present an effective way to train robot agents via inputs from non-expert humans, without a need for a specially designed reward function. However, this approach needs a human to be present and attentive during robot learning to provide evaluative feedback. In addition, the amount of feedback needed grows with the level of task difficulty and the quality of human feedback might decrease over time because of fatigue. To overcome these limitations and enable learning more robot tasks with higher complexities, there is a need to maximize the quality of expensive feedback received and reduce the amount of human cognitive involvement required. In this work, we present an approach that uses active learning to smartly choose queries for the human supervisor based on the uncertainty of the robot and effectively reduces the amount of feedback needed to learn a given task. We also use a novel multiple buffer system to improve robustness to feedback noise and guard against catastrophic forgetting as the robot learning evolves. This makes it possible to learn tasks with more complexity using lesser amounts of human feedback compared to previous methods. We demonstrate the utility of our proposed method on a robot arm reaching task where the robot learns to reach a location in 3D without colliding with obstacles. Our approach is able to learn this task faster, with less human feedback and cognitive involvement, compared to previous methods that do not use active learning.


page 1

page 5


Learning Preferences for Interactive Autonomy

When robots enter everyday human environments, they need to understand t...

Synthesized Trust Learning from Limited Human Feedback for Human-Load-Reduced Multi-Robot Deployments

Human multi-robot system (MRS) collaboration is demonstrating potentials...

Learning Multimodal Rewards from Rankings

Learning from human feedback has shown to be a useful approach in acquir...

Automatic Playtesting for Game Parameter Tuning via Active Learning

Game designers use human playtesting to gather feedback about game desig...

Learning to Predict Robot Keypoints Using Artificially Generated Images

This work considers robot keypoint estimation on color images as a super...

Bayesian Active Learning for Collaborative Task Specification Using Equivalence Regions

Specifying complex task behaviours while ensuring good robot performance...

Reinforcement Learning with Feedback from Multiple Humans with Diverse Skills

A promising approach to improve the robustness and exploration in Reinfo...

Please sign up or login with your details

Forgot password? Click here to reset