Joint speaker diarisation and tracking in switching state-space model

09/23/2021
by   Jeremy H. M. Wong, et al.
0

Speakers may move around while diarisation is being performed. When a microphone array is used, the instantaneous locations of where the sounds originated from can be estimated, and previous investigations have shown that such information can be complementary to speaker embeddings in the diarisation task. However, these approaches often assume that speakers are fairly stationary throughout a meeting. This paper relaxes this assumption, by proposing to explicitly track the movements of speakers while jointly performing diarisation within a unified model. A state-space model is proposed, where the hidden state expresses the identity of the current active speaker and the predicted locations of all speakers. The model is implemented as a particle filter. Experiments on a Microsoft rich meeting transcription task show that the proposed joint location tracking and diarisation approach is able to perform comparably with other methods that use location information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2021

Diarisation using location tracking with agglomerative clustering

Previous works have shown that spatial location information can be compl...
research
10/28/2017

Jointly Tracking and Separating Speech Sources Using Multiple Features and the generalized labeled multi-Bernoulli Framework

This paper proposes a novel joint multi-speaker tracking-and-separation ...
research
07/20/2021

A Real-time Speaker Diarization System Based on Spatial Spectrum

In this paper we describe a speaker diarization system that enables loca...
research
07/13/2020

DNN Speaker Tracking with Embeddings

In multi-speaker applications is common to have pre-computed models from...
research
12/04/2018

Intensity Particle Flow SMC-PHD Filter For Audio Speaker Tracking

Non-zero diffusion particle flow Sequential Monte Carlo probability hypo...
research
02/20/2023

Differentiable Bootstrap Particle Filters for Regime-Switching Models

Differentiable particle filters are an emerging class of particle filter...
research
12/11/2018

A cascaded multiple-speaker localization and tracking system

This paper presents an online multiple-speaker localization and tracking...

Please sign up or login with your details

Forgot password? Click here to reset