Deep Neural Network Voice Activity Detector for Downsampled Audio Data: An Experiment Report

08/12/2021
by   Mikael Ovaska, et al.
0

Sociometric badges are an emerging technology for study how teams interact in physical places. Audio data recorded by sociometric badges is often downsampled to not record discussions of the sociometric badges holders. To gain more information about interactions inside teams with sociometric badges a Voice Activity Detector (VAD) is deployed to measure verbal activity of the interaction. Detecting voice activity from downsampled audio data is challenging because down-sampling destroys information from the data. We developed a VAD using deep learning techniques that achieves only moderate accuracy in a low noise meeting setting and in across variable noise levels despite excellent validation performance. Experiences and lessons learned while developing the VAD are discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2020

Crossmodal learning for audio-visual speech event localization

An objective understanding of media depictions, such as about inclusive ...
research
11/28/2016

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

We propose a single neural network architecture for two tasks: on-line k...
research
12/04/2017

Precision Scaling of Neural Networks for Efficient Audio Processing

While deep neural networks have shown powerful performance in many audio...
research
08/21/2020

RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns

Voice Activity Detection (VAD) refers to the task of identification of r...
research
11/08/2019

Voice Activity Detection in presence of background noise using EEG

In this paper we demonstrate that performance of voice activity detectio...
research
03/03/2023

SottoVoce: An Ultrasound Imaging-Based Silent Speech Interaction Using Deep Neural Networks

The availability of digital devices operated by voice is expanding rapid...
research
08/09/2021

Experiences with the Introduction of AI-based Tools for Moderation Automation of Voice-based Participatory Media Forums

Voice-based discussion forums where users can record audio messages whic...

Please sign up or login with your details

Forgot password? Click here to reset