Large-scale Speaker Retrieval on Random Speaker Variability Subspace

11/27/2018
by   Suwon Shon, et al.
0

This paper describes a fast speaker search system to retrieve segments of the same voice identity in the large-scale data. Locality Sensitive Hashing (LSH) is a fast nearest neighbor search algorithm and the recent study shows that LSH enables quick retrieval of a relevant voice in the large-scale data in conjunction with i-vector while maintaining accuracy. In this paper, we proposed Random Speaker-variability Subspace (RSS) projection to map a data into hash tables. We hypothesized that rather than projecting on random subspace, projecting on speaker variability space would give more chance to put the same speaker representation into the same hash bins, so we can use less number of hash tables. We use Linear Discriminant Analysis (LDA) to generate speaker variability subspace projection matrix. Additionally, a random subset of the speaker in the training data was chosen for speaker label for LDA to produce multiple RSS. From the experimental result, the proposed approach shows 100 times and 7 times faster than the linear search and LSH, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2018

Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition

Linear Discriminant Analysis (LDA) has been used as a standard post-proc...
research
11/15/2022

Rapid Connectionist Speaker Adaptation

We present SVCnet, a system for modelling speaker variability. Encoder N...
research
04/18/2019

Query-Adaptive Hash Code Ranking for Large-Scale Multi-View Visual Search

Hash based nearest neighbor search has become attractive in many applica...
research
03/10/2021

MP-RW-LSH: An Efficient Multi-Probe LSH Solution to ANNS in L_1 Distance

Approximate Nearest Neighbor Search (ANNS) is a fundamental algorithmic ...
research
06/05/2023

Large-Scale Distributed Learning via Private On-Device Locality-Sensitive Hashing

Locality-sensitive hashing (LSH) based frameworks have been used efficie...
research
10/30/2020

Deep generative LDA

Linear discriminant analysis (LDA) is a popular tool for classification ...
research
11/08/2018

Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search

The popularization of science can often be disregarded by scientists as ...

Please sign up or login with your details

Forgot password? Click here to reset