An Environmental Feature Representation in I-vector Space for Room Verification and Metadata Estimation

03/09/2022
by   Desmond Caulley, et al.
0

This paper investigates the application of environmental feature representations for room verification tasks and acoustic meta-data estimation. Audio recordings contain both speaker and non-speaker information. We refer to the non-speaker-related information, including channel and other environmental factors, as e-vectors. I-vectors, commonly used in speaker identification, are extracted in the total variability space and capture both speaker and channel-environment information without discrimination. Accordingly, e-vectors can be extracted from i-vectors using methods such as linear discriminant analysis. In this paper, we first demonstrate that e-vectors can be successfully applied to room verification tasks with a low equal error rate. Second, we propose two methods for estimating metadata information – signal-to-noise (SNR) and reverberation (T60) – from these e-vectors. When comparing our system to contemporary global SNR estimation methods, in terms of accuracy, we perform favorably even with low dimensional i-vectors. Lastly, we show that room verification tasks can be improved if e-vectors are augmented with the extracted metadata information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2019

Probing the Information Encoded in x-vectors

Deep neural network based speaker embeddings, such as x-vectors, have be...
research
01/28/2022

Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System

Audio analysis for forensic speaker verification offers unique challenge...
research
11/16/2022

Speaker Adaptation for End-To-End Speech Recognition Systems in Noisy Environments

We analyze the impact of speaker adaptation in end-to-end architectures ...
research
02/10/2020

An empirical analysis of information encoded in disentangled neural speaker representations

The primary characteristic of robust speaker representations is that the...
research
06/12/2013

Robust Support Vector Machines for Speaker Verification Task

An important step in speaker verification is extracting features that be...
research
11/24/2021

A Study on Decoupled Probabilistic Linear Discriminant Analysis

Probabilistic linear discriminant analysis (PLDA) has broad application ...
research
09/29/2017

PLDA-Based Diarization of Telephone Conversations

This paper investigates the application of the probabilistic linear disc...

Please sign up or login with your details

Forgot password? Click here to reset