Using Pause Information for More Accurate Entity Recognition

09/27/2021
by   Sahas Dendukuri, et al.
0

Entity tags in human-machine dialog are integral to natural language understanding (NLU) tasks in conversational assistants. However, current systems struggle to accurately parse spoken queries with the typical use of text input alone, and often fail to understand the user intent. Previous work in linguistics has identified a cross-language tendency for longer speech pauses surrounding nouns as compared to verbs. We demonstrate that the linguistic observation on pauses can be used to improve accuracy in machine-learnt language understanding tasks. Analysis of pauses in French and English utterances from a commercial voice assistant shows the statistically significant difference in pause duration around multi-token entity span boundaries compared to within entity spans. Additionally, in contrast to text-based NLU, we apply pause duration to enrich contextual embeddings to improve shallow parsing of entities. Results show that our proposed novel embeddings improve the relative error rate by up to 8 three domains for French, without any added annotation or alignment costs to the parser.

READ FULL TEXT

page 3

page 4

page 5

research
09/07/2019

Dependency Parsing for Spoken Dialog Systems

Dependency parsing of conversational input can play an important role in...
research
03/13/2019

Benchmarking Natural Language Understanding Services for building Conversational Agents

We have recently seen the emergence of several publicly available Natura...
research
02/26/2022

Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems

The lack of speech data annotated with labels required for spoken langua...
research
12/23/2018

Water quality information dissemination at real-time in South Africa using language modelling

We present a conversational model to apprise users with limited access t...
research
04/05/2021

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding

Word Error Rate (WER) has been the predominant metric used to evaluate t...
research
05/22/2020

Givenness Hierarchy Theoretic Cognitive Status Filtering

For language-capable interactive robots to be effectively introduced int...
research
12/03/2019

Fast Intent Classification for Spoken Language Understanding

Spoken Language Understanding (SLU) systems consist of several machine l...

Please sign up or login with your details

Forgot password? Click here to reset