Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations

08/06/2019
by   Aarne Talman, et al.
0

In this paper we introduce a new natural language processing dataset and benchmark for predicting prosodic prominence from written text. To our knowledge this will be the largest publicly available dataset with prosodic labels. We describe the dataset construction and the resulting benchmark dataset in detail and train a number of different models ranging from feature-based classifiers to neural network systems for the prediction of discretized prosodic prominence. We show that pre-trained contextualized word representations from BERT outperform the other models even with less than 10 of the training data. Finally we discuss the dataset in light of the results and point to future research and plans for further improving both the dataset and methods of predicting prosodic prominence from text. The dataset and the code for the models are publicly available.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 8

page 9

page 10

research
12/26/2017

Advances in Pre-Training Distributed Word Representations

Many Natural Language Processing applications nowadays rely on pre-train...
research
01/23/2018

The WiLI benchmark dataset for written language identification

This paper describes the WiLI-2018 benchmark dataset for monolingual wri...
research
07/14/2023

TVPR: Text-to-Video Person Retrieval and a New Benchmark

Most existing methods for text-based person retrieval focus on text-to-i...
research
03/03/2019

Predicting Algorithm Classes for Programming Word Problems

We introduce the task of algorithm class prediction for programming word...
research
07/28/2021

MWP-BERT: A Strong Baseline for Math Word Problems

Math word problem (MWP) solving is the task of transforming a sequence o...
research
06/01/2019

"President Vows to Cut <Taxes> Hair": Dataset and Analysis of Creative Text Editing for Humorous Headlines

We introduce, release, and analyze a new dataset, called Humicroedit, fo...
research
04/10/2022

Pushing on Personality Detection from Verbal Behavior: A Transformer Meets Text Contours of Psycholinguistic Features

Research at the intersection of personality psychology, computer science...

Please sign up or login with your details

Forgot password? Click here to reset