Speaker attribution with voice profiles by graph-based semi-supervised learning

02/06/2021
by   Jixuan Wang, et al.
0

Speaker attribution is required in many real-world applications, such as meeting transcription, where speaker identity is assigned to each utterance according to speaker voice profiles. In this paper, we propose to solve the speaker attribution problem by using graph-based semi-supervised learning methods. A graph of speech segments is built for each session, on which segments from voice profiles are represented by labeled nodes while segments from test utterances are unlabeled nodes. The weight of edges between nodes is evaluated by the similarities between the pretrained speaker embeddings of speech segments. Speaker attribution then becomes a semi-supervised learning problem on graphs, on which two graph-based methods are applied: label propagation (LP) and graph neural networks (GNNs). The proposed approaches are able to utilize the structural information of the graph to improve speaker attribution performance. Experimental results on real meeting data show that the graph based approaches reduce speaker attribution error by up to 68 compared to a baseline speaker identification approach that processes each utterance independently.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2021

Graph-based Label Propagation for Semi-Supervised Speaker Identification

Speaker identification in the household scenario (e.g., for smart speake...
research
03/01/2022

Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR

Graph-based temporal classification (GTC), a generalized form of the con...
research
07/08/2022

Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification

Speaker identification (SID) in the household scenario (e.g., for smart ...
research
02/23/2020

End-To-End Graph-based Deep Semi-Supervised Learning

The quality of a graph is determined jointly by three key factors of the...
research
07/07/2023

Improving Automatic Quotation Attribution in Literary Novels

Current models for quotation attribution in literary novels assume varyi...
research
06/08/2020

Speaker Diarization as a Fully Online Learning Problem in MiniVox

We proposed a novel AI framework to conduct real-time multi-speaker diar...
research
07/23/2020

Grale: Designing Networks for Graph Learning

How can we find the right graph for semi-supervised learning? In real wo...

Please sign up or login with your details

Forgot password? Click here to reset