A Study on Dialog Act Recognition using Character-Level Tokenization

05/18/2018
by   Eugénio Ribeiro, et al.
0

Dialog act recognition is an important step for dialog systems since it reveals the intention behind the uttered words. Most approaches on the task use word-level tokenization. In contrast, this paper explores the use of character-level tokenization. This is relevant since there is information at the sub-word level that is related to the function of the words and, thus, their intention. We also explore the use of different context windows around each token, which are able to capture important elements, such as affixes. Furthermore, we assess the importance of punctuation and capitalization. We performed experiments on both the Switchboard Dialog Act Corpus and the DIHANA Corpus. In both cases, the experiments not only show that character-level tokenization leads to better performance than the typical word-level approaches, but also that both approaches are able to capture complementary information. Thus, the best results are achieved by combining tokenization at both levels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2018

Deep Dialog Act Recognition using Multiple Token, Segment, and Context Information Representations

A dialog act is a representation of an intention transmitted in the form...
research
12/05/2016

Mapping the Dialog Act Annotations of the LEGO Corpus into the Communicative Functions of ISO 24617-2

In this paper we present strategies for mapping the dialog act annotatio...
research
07/29/2019

Hierarchical Multi-Label Dialog Act Recognition on Spanish Data

Dialog acts reveal the intention behind the uttered words. Thus, their a...
research
03/31/2023

Dialog act guided contextual adapter for personalized speech recognition

Personalization in multi-turn dialogs has been a long standing challenge...
research
07/05/2021

What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition

Dialog acts can be interpreted as the atomic units of a conversation, mo...
research
05/07/2019

Show, Price and Negotiate: A Hierarchical Attention Recurrent Visual Negotiator

Negotiation, as a seller or buyer, is an essential and complicated aspec...
research
02/14/2022

FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows

Despite recent progress in open-domain dialogue evaluation, how to devel...

Please sign up or login with your details

Forgot password? Click here to reset