Span Classification with Structured Information for Disfluency Detection in Spoken Utterances

03/30/2022
by   Sreyan Ghosh, et al.
0

Existing approaches in disfluency detection focus on solving a token-level classification task for identifying and removing disfluencies in text. Moreover, most works focus on leveraging only contextual information captured by the linear sequences in text, thus ignoring the structured information in text which is efficiently captured by dependency trees. In this paper, building on the span classification paradigm of entity recognition, we propose a novel architecture for detecting disfluencies in transcripts from spoken utterances, incorporating both contextual information through transformers and long-distance structured information captured by dependency trees, through graph convolutional networks (GCNs). Experimental results show that our proposed model achieves state-of-the-art results on the widely used English Switchboard for disfluency detection and outperforms prior-art by a significant margin. We make all our codes publicly available on GitHub (https://github.com/Sreyan88/Disfluency-Detection-with-Span-Classification)

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2021

Better Feature Integration for Named Entity Recognition

It has been shown that named entity recognition (NER) could benefit from...
research
04/15/2021

Detect and Classify – Joint Span Detection and Classification for Health Outcomes

A health outcome is a measurement or an observation used to capture and ...
research
11/27/2022

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances

Disfluency, though originating from human spoken utterances, is primaril...
research
08/12/2021

Combining (second-order) graph-based and headed span-based projective dependency parsing

Graph-based methods are popular in dependency parsing for decades. Recen...
research
02/24/2021

NLRG at SemEval-2021 Task 5: Toxic Spans Detection Leveraging BERT-based Token Classification and Span Prediction Techniques

Toxicity detection of text has been a popular NLP task in the recent yea...
research
09/13/2021

Pack Together: Entity and Relation Extraction with Levitated Marker

Named Entity Recognition (NER) and Relation Extraction (RE) are the core...
research
07/26/2022

Contextual Text Block Detection towards Scene Text Understanding

Most existing scene text detectors focus on detecting characters or word...

Please sign up or login with your details

Forgot password? Click here to reset