HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks

08/24/2023
by   ZiChao Dong, et al.
0

Human robot interaction is an exciting task, which aimed to guide robots following instructions from human. Since huge gap lies between human natural language and machine codes, end to end human robot interaction models is fair challenging. Further, visual information receiving from sensors of robot is also a hard language for robot to perceive. In this work, HuBo-VLM is proposed to tackle perception tasks associated with human robot interaction including object detection and visual grounding by a unified transformer based vision language model. Extensive experiments on the Talk2Car benchmark demonstrate the effectiveness of our approach. Code would be publicly available in https://github.com/dzcgaara/HuBo-VLM.

READ FULL TEXT

page 4

page 7

research
04/30/2019

Learning from Implicit Information in Natural Language Instructions for Robotic Manipulations

Human-robot interaction often occurs in the form of instructions given f...
research
08/25/2023

Formalising Natural Language Quantifiers for Human-Robot Interactions

We present a method for formalising quantifiers in natural language in t...
research
12/12/2018

Towards Understanding Language through Perception in Situated Human-Robot Interaction: From Word Grounding to Grammar Induction

Robots are widely collaborating with human users in diferent tasks that ...
research
08/30/2023

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Enabling robots to understand language instructions and react accordingl...
research
03/23/2023

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding

Aiming to link natural language descriptions to specific regions in a 3D...
research
09/08/2023

Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models

Natural-language dialog is key for intuitive human-robot interaction. It...
research
12/09/2020

Proactive Interaction Framework for Intelligent Social Receptionist Robots

Proactive human-robot interaction (HRI) allows the receptionist robots t...

Please sign up or login with your details

Forgot password? Click here to reset