Speech Enhancement for Virtual Meetings on Cellular Networks

02/02/2023
by   Hojeong Lee, et al.
0

We study speech enhancement using deep learning (DL) for virtual meetings on cellular devices, where transmitted speech has background noise and transmission loss that affects speech quality. Since the Deep Noise Suppression (DNS) Challenge dataset does not contain practical disturbance, we collect a transmitted DNS (t-DNS) dataset using Zoom Meetings over T-Mobile network. We select two baseline models: Demucs and FullSubNet. The Demucs is an end-to-end model that takes time-domain inputs and outputs time-domain denoised speech, and the FullSubNet takes time-frequency-domain inputs and outputs the energy ratio of the target speech in the inputs. The goal of this project is to enhance the speech transmitted over the cellular networks using deep learning models.

READ FULL TEXT

page 6

page 11

page 12

research
01/22/2023

Cellular Network Speech Enhancement: Removing Background and Transmission Noise

The primary objective of speech enhancement is to reduce background nois...
research
06/04/2021

A Database for Research on Detection and Enhancement of Speech Transmitted over HF links

In this paper we present an open database for the development of detecti...
research
10/03/2021

Progressive Transmission and Inference of Deep Learning Models

Modern image files are usually progressively transmitted and provide a p...
research
08/24/2020

AMRConvNet: AMR-Coded Speech Enhancement Using Convolutional Neural Networks

Speech is converted to digital signals using speech coding for efficient...
research
01/22/2021

Towards efficient models for real-time deep noise suppression

With recent research advancements, deep learning models are becoming att...
research
10/20/2020

Investigating Cross-Domain Losses for Speech Enhancement

Recent years have seen a surge in the number of available frameworks for...
research
11/05/2019

Speech Enhancement via Deep Spectrum Image Translation Network

Quality and intelligibility of speech signals are degraded under additiv...

Please sign up or login with your details

Forgot password? Click here to reset