Sequence Segmentation Using Joint RNN and Structured Prediction Models

10/25/2016
by   Yossi Adi, et al.
0

We describe and analyze a simple and effective algorithm for sequence segmentation applied to speech processing tasks. We propose a neural architecture that is composed of two modules trained jointly: a recurrent neural network (RNN) module and a structured prediction model. The RNN outputs are considered as feature functions to the structured model. The overall model is trained with a structured loss function which can be designed to the given segmentation task. We demonstrate the effectiveness of our method by applying it to two simple tasks commonly used in phonetic studies: word segmentation and voice onset time segmentation. Results sug- gest the proposed model is superior to previous methods, ob- taining state-of-the-art results on the tested datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2018

Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer

We investigate training end-to-end speech recognition models with the re...
research
10/27/2019

Dr.VOT : Measuring Positive and Negative Voice Onset Time in the Wild

Voice Onset Time (VOT), a key measurement of speech for basic research a...
research
09/27/2017

An attentive neural architecture for joint segmentation and parsing and its application to real estate ads

In this paper we develop a relatively simple and effective neural joint ...
research
08/01/2016

Blind phoneme segmentation with temporal prediction errors

Phonemic segmentation of speech is a critical step of speech recognition...
research
11/28/2017

Recurrent Segmentation for Variable Computational Budgets

State-of-the-art systems for semantic image segmentation utilize feed-fo...
research
09/25/2022

Towards Stable Co-saliency Detection and Object Co-segmentation

In this paper, we present a novel model for simultaneous stable co-salie...
research
04/05/2017

Automatic Measurement of Pre-aspiration

Pre-aspiration is defined as the period of glottal friction occurring in...

Please sign up or login with your details

Forgot password? Click here to reset