Speaker-Utterance Dual Attention for Speaker and Utterance Verification

08/20/2020
by   Tianchi Liu, et al.
0

In this paper, we study a novel technique that exploits the interaction between speaker traits and linguistic content to improve both speaker verification and utterance verification performance. We implement an idea of speaker-utterance dual attention (SUDA) in a unified neural network. The dual attention refers to an attention mechanism for the two tasks of speaker and utterance verification. The proposed SUDA features an attention mask mechanism to learn the interaction between the speaker and utterance information streams. This helps to focus only on the required information for respective task by masking the irrelevant counterparts. The studies conducted on RSR2015 corpus confirm that the proposed SUDA outperforms the framework without attention mask as well as several competitive systems for both speaker and utterance verification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/03/2017

End-to-End Attention based Text-Dependent Speaker Verification

A new type of End-to-End system for text-dependent speaker verification ...
research
10/17/2019

H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model

In this paper, a hierarchical attention network to generate utterance-le...
research
11/03/2020

Small footprint Text-Independent Speaker Verification for Embedded Systems

Deep neural network approaches to speaker verification have proven succe...
research
02/03/2022

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

The time delay neural network (TDNN) represents one of the state-of-the-...
research
11/03/2019

Interpreting Verbal Irony: Linguistic Strategies and the Connection to theType of Semantic Incongruity

Human communication often involves the use of verbal irony or sarcasm, w...
research
11/03/2019

Interpreting Verbal Irony: Linguistic Strategies and the Connection to the Type of Semantic Incongruity

Human communication often involves the use of verbal irony or sarcasm, w...
research
10/22/2020

Graph Attention Networks for Speaker Verification

This work presents a novel back-end framework for speaker verification u...

Please sign up or login with your details

Forgot password? Click here to reset