MQA: Answering the Question via Robotic Manipulation

03/10/2020
by   Yuhong Deng, et al.
34

In this paper,we propose a novel task of Manipulation Question Answering(MQA),a class of Question Answering (QA) task, where the robot is required to find the answer to the question by actively interacting with the environment via manipulation. Considering the tabletop scenario, a heatmap of the scene is generated to facilitate the robot to have a semantic understanding of the scene and an imitation learning approach with semantic understanding metric is proposed to generate manipulation actions which guide the manipulator to explore the tabletop to find the answer to the question. Besides, a novel dataset which contains a variety of tabletop scenarios and corresponding question-answer pairs is established. Extensive experiments have been conducted to validate the effectiveness of the proposed framework.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 8

research
07/16/2022

Scene Graph for Embodied Exploration in Cluttered Scenario

The ability to handle objects in cluttered environment has been long ant...
research
10/06/2022

Embodied Referring Expression for Manipulation Question Answering in Interactive Environment

Embodied agents are expected to perform more complicated tasks in an int...
research
05/25/2021

Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting

This paper explores the task of Difficulty-Controllable Question Generat...
research
09/22/2021

Audio-Visual Grounding Referring Expression for Robotic Manipulation

Referring expressions are commonly used when referring to a specific tar...
research
07/07/2023

Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation

What makes generalization hard for imitation learning in visual robotic ...
research
12/04/2015

Learning the Semantics of Manipulation Action

In this paper we present a formal computational framework for modeling m...
research
11/26/2020

Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction

In open-domain question answering, questions are highly likely to be amb...

Please sign up or login with your details

Forgot password? Click here to reset