Addressing the Blind Spots in Spoken Language Processing

09/06/2023
by   Amit Moryossef, et al.
0

This paper explores the critical but often overlooked role of non-verbal cues, including co-speech gestures and facial expressions, in human communication and their implications for Natural Language Processing (NLP). We argue that understanding human communication requires a more holistic approach that goes beyond textual or spoken words to include non-verbal elements. Borrowing from advances in sign language processing, we propose the development of universal automatic gesture segmentation and transcription models to transcribe these non-verbal cues into textual form. Such a methodology aims to bridge the blind spots in spoken language understanding, enhancing the scope and applicability of NLP models. Through motivating examples, we demonstrate the limitations of relying solely on text-based models. We propose a computationally efficient and flexible approach for incorporating non-verbal cues, which can seamlessly integrate with existing NLP pipelines. We conclude by calling upon the research community to contribute to the development of universal transcription methods and to validate their effectiveness in capturing the complexities of real-world, multi-modal interactions.

READ FULL TEXT
research
05/08/2023

Putting Natural in Natural Language Processing

Human language is firstly spoken and only secondarily written. Text, h...
research
02/15/2022

textless-lib: a Library for Textless Spoken Language Processing

Textless spoken language processing research aims to extend the applicab...
research
02/10/2020

Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery

Text-based representations of chemicals and proteins can be thought of a...
research
12/10/2021

DEBACER: a method for slicing moderated debates

Subjects change frequently in moderated debates with several participant...
research
11/10/2022

An Inclusive Notion of Text

Natural language processing researchers develop models of grammar, meani...
research
05/15/2018

Marrying up Regular Expressions with Neural Networks: A Case Study for Spoken Language Understanding

The success of many natural language processing (NLP) tasks is bound by ...
research
06/10/2023

Universal Language Modelling agent

Large Language Models are designed to understand complex Human Language....

Please sign up or login with your details

Forgot password? Click here to reset