Improving coreference resolution with automatically predicted prosodic information

07/28/2017
by   Ina Rösiger, et al.
0

Adding manually annotated prosodic information, specifically pitch accents and phrasing, to the typical text-based feature set for coreference resolution has previously been shown to have a positive effect on German data. Practical applications on spoken language, however, would rely on automatically predicted prosodic information. In this paper we predict pitch accents (and phrase boundaries) using a convolutional neural network (CNN) model from acoustic features extracted from the speech signal. After an assessment of the quality of these automatic prosodic annotations, we show that they also significantly improve coreference resolution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2019

Automatic assessment of spoken language proficiency of non-native children

This paper describes technology developed to automatically grade Italian...
research
04/24/2017

Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing

In conversational speech, the acoustic signal provides cues that help li...
research
06/15/2021

Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis

Text does not fully specify the spoken form, so text-to-speech models mu...
research
10/26/2016

Automatic measurement of vowel duration via structured prediction

A key barrier to making phonetic studies scalable and replicable is the ...
research
06/02/2017

Prosodic Event Recognition using Convolutional Neural Networks with Context Information

This paper demonstrates the potential of convolutional neural networks (...
research
06/16/2019

Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary Detection

The common practice in coreference resolution is to identify and evaluat...

Please sign up or login with your details

Forgot password? Click here to reset