Meta Preference Learning for Fast User Adaptation in Human-Supervisory Multi-Robot Deployments

by   Chao Huang, et al.

As multi-robot systems (MRS) are widely used in various tasks such as natural disaster response and social security, people enthusiastically expect an MRS to be ubiquitous that a general user without heavy training can easily operate. However, humans have various preferences on balancing between task performance and safety, imposing different requirements onto MRS control. Failing to comply with preferences makes people feel difficult in operation and decreases human willingness of using an MRS. Therefore, to improve social acceptance as well as performance, there is an urgent need to adjust MRS behaviors according to human preferences before triggering human corrections, which increases cognitive load. In this paper, a novel Meta Preference Learning (MPL) method was developed to enable an MRS to fast adapt to user preferences. MPL based on meta learning mechanism can quickly assess human preferences from limited instructions; then, a neural network based preference model adjusts MRS behaviors for preference adaption. To validate method effectiveness, a task scenario "An MRS searches victims in an earthquake disaster site" was designed; 20 human users were involved to identify preferences as "aggressive", "medium", "reserved"; based on user guidance and domain knowledge, about 20,000 preferences were simulated to cover different operations related to "task quality", "task progress", "robot safety". The effectiveness of MPL in preference adaption was validated by the reduced duration and frequency of human interventions.


page 1

page 4

page 5


Towards Preference Learning for Autonomous Ground Robot Navigation Tasks

We are interested in the design of autonomous robot behaviors that learn...

Warmth and Competence to Predict Human Preference of Robot Behavior in Physical Human-Robot Interaction

A solid methodology to understand human perception and preferences in hu...

Balancing Shared Autonomy with Human-Robot Communication

Robotic agents that share autonomy with a human should leverage human do...

Construction and Adaptability Analysis of User's Preference Models Based on Check-in Data in LBSN

With the widespread use of mobile phones, users can share their location...

Learning from Physical Human Feedback: An Object-Centric One-Shot Adaptation Method

For robots to be effectively deployed in novel environments and tasks, t...

How to cluster nearest unique nodes from different classes using JJCluster in Wisp application?

The work of finding the best place according to user preference is a ted...

Two-Stage Clustering of Human Preferences for Action Prediction in Assembly Tasks

To effectively assist human workers in assembly tasks a robot must proac...