Object Permanence Through Audio-Visual Representations

10/20/2020
by   Fanjun Bu, et al.
0

As robots perform manipulation tasks and interact with objects, it is probable that they accidentally drop objects that subsequently bounce out of their visual fields (e.g., due to an inadequate grasp of an unfamiliar object). To enable robots to recover from such errors, we draw upon the concept of object permanence—objects remain in existence even when they are not being sensed (e.g., seen) directly. In particular, we developed a multimodal neural network model—using a partial, observed bounce trajectory and the audio resulting from drop impact as its inputs—to predict the full bounce trajectory and the end location of a dropped object. We empirically show that: (1) our multimodal method predicted end locations close in proximity (i.e., within the visual field of the robot's wrist camera) to the actual locations and (2) the robot was able to retrieve dropped objects by applying minimal vision-based pick-up adjustments. Additionally, we show that our multimodal method outperformed the vision-only and audio-only baselines in retrieving dropped objects. Our results provide insights in enabling object permanence for robots and offer foundations for ensuring robust robot autonomy in task execution.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

research
10/09/2021

Multimodal Sensory Learning for Real-time, Adaptive Manipulation

Adaptive control for real-time manipulation requires quick estimation an...
research
02/22/2022

ReorientBot: Learning Object Reorientation for Specific-Posed Placement

Robots need the capability of placing objects in arbitrary, specific pos...
research
02/29/2020

Robust Robotic Pouring using Audition and Haptics

Robust and accurate estimation of liquid height lies as an essential par...
research
03/07/2023

Cross-Tool and Cross-Behavior Perceptual Knowledge Transfer for Grounded Object Recognition

Humans learn about objects via interaction and using multiple perception...
research
03/18/2020

The State of Service Robots: Current Bottlenecks in Object Perception and Manipulation

Service robots are appearing more and more in our daily life. The develo...
research
09/21/2021

Learning to Guide Human Attention on Mobile Telepresence Robots with 360 degree Vision

Mobile telepresence robots (MTRs) allow people to navigate and interact ...
research
10/05/2021

3D-MOV: Audio-Visual LSTM Autoencoder for 3D Reconstruction of Multiple Objects from Video

3D object reconstructions of transparent and concave structured objects,...

Please sign up or login with your details

Forgot password? Click here to reset