Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

11/12/2019
by   Guillermo Cámbara, et al.
0

The use of photoplethysmogram signal (PPG) for heart and sleep monitoring is commonly found nowadays in smartphones and wrist wearables. Besides common usages, it has been proposed and reported that person information can be extracted from PPG for other uses, like biometry tasks. In this work, we explore several end-to-end convolutional neural network architectures for detection of human's characteristics such as gender or person identity. In addition, we evaluate whether speech/non-speech events may be inferred from PPG signal, where speech might translate in fluctuations into the pulse signal. The obtained results are promising and clearly show the potential of fully end-to-end topologies for automatic extraction of meaningful biomarkers, even from a noisy signal sampled by a low-cost PPG sensor. The AUCs for best architectures put forward PPG wave as biological discriminant, reaching 79% and 89.0%, respectively for gender and person verification tasks. Furthermore, speech detection experiments reporting AUCs around 69% encourage us for further exploration about the feasibility of PPG for speech processing tasks.

READ FULL TEXT
research
06/17/2019

Robust End to End Speaker Verification Using EEG

In this paper we demonstrate that performance of a speaker verification ...
research
10/03/2021

PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction

Speech enhancement aims to improve the perceptual quality of the speech ...
research
04/22/2021

Protecting gender and identity with disentangled speech representations

Besides its linguistic content, our speech is rich in biometric informat...
research
08/01/2017

Improved Speech Reconstruction from Silent Video

Speechreading is the task of inferring phonetic information from visuall...
research
01/02/2017

Vid2speech: Speech Reconstruction from Silent Video

Speechreading is a notoriously difficult task for humans to perform. In ...
research
12/15/2020

Automatic Speech Verification Spoofing Detection

Automatic speech verification (ASV) is the technology to determine the i...
research
11/29/2021

Speech Tasks Relevant to Sleepiness Determined with Deep Transfer Learning

Excessive sleepiness in attention-critical contexts can lead to adverse ...

Please sign up or login with your details

Forgot password? Click here to reset