Mobile-Env: A Universal Platform for Training and Evaluation of Mobile Interaction

05/14/2023
by   Danyang Zhang, et al.
0

The interaction platform plays a crucial role in the recent advancement of the control and decision domains like game playing and embodied intelligence. However, there is still a lack of a satisfactory platform for the information user interface (InfoUI) interaction. The proposed InfoUI comprises not only the plain text information, but the multimodal contents and a few spatial structures with styles as well. To help the research of InfoUI interaction, a novel platform Mobile-Env is presented in this paper. The Mobile-Env platform is designed to be flexible, adaptable, and easily-extended. Based on Mobile-Env, an InfoUI task set is then built for a demonstration and evaluation. An agent based on the large-scale language model (LLM) is tested on the task set. The experiment results demonstrate the great potential of the LLM to do text understanding and matching and, meanwhile, reveal the necessity of a better mechanism of interaction feedback and exploration. Several new discussions are conducted as well. A demo video is available at https://youtu.be/gKV6KZYwxGY. The code repository is available at https://github.com/X-LANCE/Mobile-Env. The proposed WikiHow task set is made public at https://huggingface.co/datasets/zdy023/WikiHow-taskset.

READ FULL TEXT

page 2

page 6

research
10/29/2021

ARviz – An Augmented Reality-enabled Visualization Platform for ROS Applications

Current robot interfaces such as teach pendants and 2D screen displays u...
research
03/14/2023

CB2: Collaborative Natural Language Interaction Research Platform

CB2 is a multi-agent platform to study collaborative natural language in...
research
08/06/2023

LARCH: Large Language Model-based Automatic Readme Creation with Heuristics

Writing a readme is a crucial aspect of software development as it plays...
research
07/15/2021

A Multimodal Machine Learning Framework for Teacher Vocal Delivery Evaluation

The quality of vocal delivery is one of the key indicators for evaluatin...
research
05/24/2023

HuatuoGPT, towards Taming Language Model to Be a Doctor

In this paper, we present HuatuoGPT, a large language model (LLM) for me...
research
10/21/2021

LOA: Logical Optimal Actions for Text-based Interaction Games

We present Logical Optimal Actions (LOA), an action decision architectur...
research
09/14/2023

Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Human-Scene Interaction (HSI) is a vital component of fields like embodi...

Please sign up or login with your details

Forgot password? Click here to reset