VOICE: Visual Oracle for Interaction, Conversation, and Explanation

04/08/2023
by   Donggang Jia, et al.
0

We present VOICE, a novel approach for connecting large language models' (LLM) conversational capabilities with interactive exploratory visualization. VOICE introduces several innovative technical contributions that drive our conversational visualization framework. Our foundation is a pack-of-bots that can perform specific tasks, such as assigning tasks, extracting instructions, and generating coherent content. We employ fine-tuning and prompt engineering techniques to tailor bots' performance to their specific roles and accurately respond to user queries, and a new prompt-based iterative scene-tree generation establishes a coupling with a structural model. Our text-to-visualization method generates a flythrough sequence matching the content explanation. Finally, 3D natural language interaction provides capabilities to navigate and manipulate the 3D models in real-time. The VOICE framework can receive arbitrary voice commands from the user and responds verbally, tightly coupled with corresponding visual representation with low latency and high accuracy. We demonstrate the effectiveness and high generalizability potential of our approach by applying it to two distinct domains: analyzing three 3D molecular models with multi-scale and multi-instance attributes, and showcasing its effectiveness on a cartographic map visualization. A free copy of this paper and all supplemental materials are available at https://osf.io/g7fbr/.

READ FULL TEXT

page 2

page 6

page 8

page 10

page 13

research
03/08/2023

Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

ChatGPT is attracting a cross-field interest as it provides a language i...
research
05/22/2023

Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models

The recent success of large language models (LLMs) has shown great poten...
research
10/09/2020

Plug-and-Play Conversational Models

There has been considerable progress made towards conversational models ...
research
12/19/2022

MIGA: A Unified Multi-task Generation Framework for Conversational Text-to-SQL

Conversational text-to-SQL is designed to translate multi-turn natural l...
research
08/30/2023

Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction

Recently, the remarkable capabilities of large language models (LLMs) ha...
research
06/16/2023

Rewriting the Script: Adapting Text Instructions for Voice Interaction

Voice assistants have sharply risen in popularity in recent years, but t...
research
04/19/2023

Affective social anthropomorphic intelligent system

Human conversational styles are measured by the sense of humor, personal...

Please sign up or login with your details

Forgot password? Click here to reset