Graph attentive feature aggregation for text-independent speaker verification

12/23/2021
by   Hye-Jin Shim, et al.
0

The objective of this paper is to combine multiple frame-level features into a single utterance-level representation considering pairwise relationship. For this purpose, we propose a novel graph attentive feature aggregation module by interpreting each frame-level feature as a node of a graph. The inter-relationship between all possible pairs of features, typically exploited indirectly, can be directly modeled using a graph. The module comprises a graph attention layer and a graph pooling layer followed by a readout operation. The graph attention layer first models the non-Euclidean data manifold between different nodes. Then, the graph pooling layer discards less informative nodes considering the significance of the nodes. Finally, the readout operation combines the remaining nodes into a single representation. We employ two recent systems, SE-ResNet and RawNet2, with different input features and architectures and demonstrate that the proposed feature aggregation module consistently shows a relative improvement over 10

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2022

Transport-Oriented Feature Aggregation for Speaker Embedding Learning

Pooling is needed to aggregate frame-level features into utterance-level...
research
01/30/2020

Structure-Feature based Graph Self-adaptive Pooling

Various methods to deal with graph data have been proposed in recent yea...
research
06/25/2021

Phoneme-aware and Channel-wise Attentive Learning for Text DependentSpeaker Verification

This paper proposes a multi-task learning network with phoneme-aware and...
research
05/14/2020

ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification

Current speaker verification techniques rely on a neural network to extr...
research
09/23/2022

Multi-Granularity Graph Pooling for Video-based Person Re-Identification

The video-based person re-identification (ReID) aims to identify the giv...
research
05/21/2022

Micro-video recommendation model based on graph neural network and attention mechanism

With the rapid development of Internet technology and the comprehensive ...
research
10/05/2020

Graph Cross Networks with Vertex Infomax Pooling

We propose a novel graph cross network (GXN) to achieve comprehensive fe...

Please sign up or login with your details

Forgot password? Click here to reset