Investigation of Synthetic Speech Detection Using Frame- and Segment-Specific Importance Weighting

10/10/2016
by   Ali Khodabakhsh, et al.
0

Speaker verification systems are vulnerable to spoofing attacks which presents a major problem in their real-life deployment. To date, most of the proposed synthetic speech detectors (SSDs) have weighted the importance of different segments of speech equally. However, different attack methods have different strengths and weaknesses and the traces that they leave may be short or long term acoustic artifacts. Moreover, those may occur for only particular phonemes or sounds. Here, we propose three algorithms that weigh likelihood-ratio scores of individual frames, phonemes, and sound-classes depending on their importance for the SSD. Significant improvement over the baseline system has been obtained for known attack methods that were used in training the SSDs. However, improvement with unknown attack types was not substantial. Thus, the type of distortions that were caused by the unknown systems were different and could not be captured better with the proposed SSD compared to the baseline SSD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2018

Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems

Automatic speaker verification (ASV) systems use a playback detector to ...
research
10/28/2022

Universal speaker recognition encoders for different speech segments duration

Creating universal speaker encoders which are robust for different acous...
research
02/28/2022

Explainable deepfake and spoofing detection: an attack analysis using SHapley Additive exPlanations

Despite several years of research in deepfake and spoofing detection for...
research
07/24/2015

The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge

Many existing speaker verification systems are reported to be vulnerable...
research
05/24/2017

Audio-replay attack detection countermeasures

This paper presents the Speech Technology Center (STC) replay attack det...
research
11/03/2020

Training Wake Word Detection with Synthesized Speech Data on Confusion Words

Confusing-words are commonly encountered in real-life keyword spotting a...

Please sign up or login with your details

Forgot password? Click here to reset