Speaker discrimination in humans and machines: Effects of speaking style variability

08/08/2020
by   Amber Afshan, et al.
0

Does speaking style variation affect humans' ability to distinguish individuals from their voices? How do humans compare with automatic systems designed to discriminate between voices? In this paper, we attempt to answer these questions by comparing human and machine speaker discrimination performance for read speech versus casual conversations. Thirty listeners were asked to perform a same versus different speaker task. Their performance was compared to a state-of-the-art x-vector/PLDA-based automatic speaker verification system. Results showed that both humans and machines performed better with style-matched stimuli, and human performance was better when listeners were native speakers of American English. Native listeners performed better than machines in the style-matched conditions (EERs of 6.96 14.35 style-mismatched conditions, there was no significant difference between native listeners and machines. In all conditions, fusing human responses with machine results showed improvements compared to each alone, suggesting that humans and machines have different approaches to speaker discrimination tasks. Differences in the approaches were further confirmed by examining results for individual speakers which showed that the perception of distinct and confused speakers differed between human listeners and machines.

READ FULL TEXT

Authors

page 1

page 2

page 3

page 4

06/28/2022

Learning from human perception to improve automatic speaker verification in style-mismatched conditions

Our prior experiments show that humans and machines seem to employ diffe...
08/15/2019

Speaker Verification Using Simple Temporal Features and Pitch Synchronous Cepstral Coefficients

Speaker verification is the process by which a speakers claim of identit...
10/12/2020

Perceptimatic: A human speech perception benchmark for unsupervised subword modelling

In this paper, we present a data set and methods to compare speech proce...
11/13/2017

Gerrymandering and Computational Redistricting

Partisan gerrymandering poses a threat to democracy. Moreover, the compl...
07/18/2016

Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction

Recent years have seen significant market penetration for voice-based pe...
03/13/2016

Learning Typographic Style

Typography is a ubiquitous art form that affects our understanding, perc...
01/31/2019

Rhythm Zone Theory: Speech Rhythms are Physical after all

Speech rhythms have been dealt with in three main ways: from the introsp...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.