Multimodal Systems: Taxonomy, Methods, and Challenges

06/06/2020
by   Muhammad Zeeshan Baig, et al.
0

Naturally, humans use multiple modalities to convey information. The modalities are processed both sequentially and in parallel for communication in the human brain, this changes when humans interact with computers. Empowering computers with the capability to process input multimodally is a major domain of investigation in Human-Computer Interaction (HCI). The advancement in technology (powerful mobile devices, advanced sensors, new ways of output, etc.) has opened up new gateways for researchers to design systems that allow multimodal interaction. It is a matter of time when the multimodal inputs will overtake the traditional ways of interactions. The paper provides an introduction to the domain of multimodal systems, explains a brief history, describes advantages of multimodal systems over unimodal systems, and discusses various modalities. The input modeling, fusion, and data collection were discussed. Finally, the challenges in the multimodal systems research were listed. The analysis of the literature showed that multimodal interface systems improve the task completion rate and reduce the errors compared to unimodal systems. The commonly used inputs for multimodal interaction are speech and gestures. In the case of multimodal inputs, late integration of input modalities is preferred by researchers because it allows easy update of modalities and corresponding vocabularies.

READ FULL TEXT

page 3

page 6

research
01/29/2019

Guidelines for creating man-machine multimodal interfaces

Understanding details of human multimodal interaction can elucidate many...
research
05/13/2022

Multimodal Conversational AI: A Survey of Datasets and Approaches

As humans, we experience the world with all our senses or modalities (so...
research
05/25/2023

MPE4G: Multimodal Pretrained Encoder for Co-Speech Gesture Generation

When virtual agents interact with humans, gestures are crucial to delive...
research
12/22/2021

Understanding and Measuring Robustness of Multimodal Learning

The modern digital world is increasingly becoming multimodal. Although m...
research
09/28/2011

Cognitive Principles in Robust Multimodal Interpretation

Multimodal conversational interfaces provide a natural means for users t...
research
01/26/2023

Emotional Interaction Qualities: Vocabulary, Modalities, Actions, And Mapping

Have you ever typed particularly powerful on your keyboard, maybe even h...
research
08/26/2020

Conversations On Multimodal Input Design With Older Adults

Multimodal input systems can help bridge the wide range of physical abil...

Please sign up or login with your details

Forgot password? Click here to reset