Socially-Aware Robot Planning via Bandit Human Feedback

by   Xusheng Luo, et al.

In this paper, we consider the problem of designing collision-free, dynamically feasible, and socially-aware trajectories for robots operating in environments populated by humans. We define trajectories to be social-aware if they do not interfere with humans in any way that causes discomfort. In this paper, discomfort is defined broadly and, depending on specific individuals, it can result from the robot being too close to a human or from interfering with human sight or tasks. Moreover, we assume that human feedback is a bandit feedback indicating a complaint or no complaint on the part of the robot trajectory that interferes with the humans, and it does not reveal any contextual information about the locations of the humans or the reason for a complaint. Finally, we assume that humans can move in the obstacle-free space and, as a result, human utility can change. We formulate this planning problem as an online optimization problem that minimizes the social value of the time-varying robot trajectory, defined by the total number of incurred human complaints. As the human utility is unknown, we employ zeroth order, or derivative-free, optimization methods to solve this problem, which we combine with off-the-shelf motion planners to satisfy the dynamic feasibility and collision-free specifications of the resulting trajectories. To the best of our knowledge, this is a new framework for socially-aware robot planning that is not restricted to avoiding collisions with humans but, instead, focuses on increasing the social value of the robot trajectories using only bandit human feedback.


page 1

page 2

page 3

page 4


Human-in-the-Loop Robot Planning with Non-Contextual Bandit Feedback

In this paper, we consider a robot navigation problem in environments po...

Game-theoretical trajectory planning enhances social acceptability for humans

Since humans and robots are increasingly sharing portions of their opera...

Perception-Aware Perching on Powerlines with Multirotors

Multirotor aerial robots are becoming widely used for the inspection of ...

Learning Safe Unlabeled Multi-Robot Planning with Motion Constraints

In this paper, we present a learning approach to goal assignment and tra...

HMPO: Human Motion Prediction in Occluded Environments for Safe Motion Planning

We present a novel approach to generate collision-free trajectories for ...

Object Gathering with a Tethered Robot Duo

We devise a cooperative planning framework to generate optimal trajector...

Interpretable Run-Time Prediction and Planning in Co-Robotic Environments

Mobile robots are traditionally developed to be reactive and avoid colli...

Please sign up or login with your details

Forgot password? Click here to reset