Multi-Modal Human-Machine Communication for Instructing Robot Grasping Tasks

05/24/2005
by   P. C. McGuire, et al.
0

A major challenge for the realization of intelligent robots is to supply them with cognitive abilities in order to allow ordinary users to program them easily and intuitively. One way of such programming is teaching work tasks by interactive demonstration. To make this effective and convenient for the user, the machine must be capable to establish a common focus of attention and be able to use and integrate spoken instructions, visual perceptions, and non-verbal clues like gestural commands. We report progress in building a hybrid architecture that combines statistical methods, neural networks, and finite state machines into an integrated system for instructing grasping tasks by man-machine interaction. The system combines the GRAVIS-robot for visual attention and gestural instruction with an intelligent interface for speech recognition and linguistic interpretation, and an modality fusion module to allow multi-modal task-oriented man-machine communication with respect to dextrous robot manipulation of objects.

READ FULL TEXT

page 1

page 5

page 6

research
09/14/2023

PROGrasp: Pragmatic Human-Robot Communication for Object Grasping

Interactive Object Grasping (IOG) is the task of identifying and graspin...
research
12/02/2022

Cross-Modal Mutual Learning for Cued Speech Recognition

Automatic Cued Speech Recognition (ACSR) provides an intelligent human-m...
research
12/21/2022

Interactive Learning-from-Observation through multimodal human demonstration

Learning-from-Observation (LfO) is a robot teaching framework for progra...
research
07/17/2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

LLMs have demonstrated remarkable abilities at interacting with humans t...
research
12/01/2020

Open-Ended Multi-Modal Relational Reason for Video Question Answering

People with visual impairments urgently need helps, not only on the basi...
research
05/02/2023

SIA-FTP: A Spoken Instruction Aware Flight Trajectory Prediction Framework

Ground-air negotiation via speech communication is a vital prerequisite ...
research
04/16/2020

MobiAxis: An Embodied Learning Task for Teaching Multiplication with a Social Robot

The use of robots in educational settings is growing increasingly popula...

Please sign up or login with your details

Forgot password? Click here to reset