Grounded Language Understanding for Manipulation Instructions Using GAN-Based Classification

01/16/2018
by   Komei Sugiura, et al.
0

The target task of this study is grounded language understanding for domestic service robots (DSRs). In particular, we focus on instruction understanding for short sentences where verbs are missing. This task is of critical importance to build communicative DSRs because manipulation is essential for DSRs. Existing instruction understanding methods usually estimate missing information only from non-grounded knowledge; therefore, whether the predicted action is physically executable or not was unclear. In this paper, we present a grounded instruction understanding method to estimate appropriate objects given an instruction and situation. We extend the Generative Adversarial Nets (GAN) and build a GAN-based classifier using latent representations. To quantitatively evaluate the proposed method, we have developed a data set based on the standard data set used for Visual QA. Experimental results have shown that the proposed method gives the better result than baseline methods.

READ FULL TEXT

page 1

page 4

research
06/17/2019

Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification

In this paper, we address multimodal language understanding for unconstr...
research
06/11/2018

A Multimodal Classifier Generative Adversarial Network for Carry and Place Tasks from Ambiguous Language Instructions

This paper focuses on a multimodal language understanding method for car...
research
09/05/2021

Modular Framework for Visuomotor Language Grounding

Natural language instruction following tasks serve as a valuable test-be...
research
05/12/2019

Improving Natural Language Interaction with Robots Using Advice

Over the last few years, there has been growing interest in learning mod...
research
02/23/2018

Interactive Image Manipulation with Natural Language Instruction Commands

We propose an interactive image-manipulation system with natural languag...
research
12/10/2022

OpenD: A Benchmark for Language-Driven Door and Drawer Opening

We introduce OPEND, a benchmark for learning how to use a hand to open c...
research
07/02/2021

Target-dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots

Currently, domestic service robots have an insufficient ability to inter...

Please sign up or login with your details

Forgot password? Click here to reset