INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

04/02/2021
by   Wei Rao, et al.
0

The ConferencingSpeech 2021 challenge is proposed to stimulate research on far-field multi-channel speech enhancement for video conferencing. The challenge consists of two separate tasks: 1) Task 1 is multi-channel speech enhancement with single microphone array and focusing on practical application with real-time requirement and 2) Task 2 is multi-channel speech enhancement with multiple distributed microphone arrays, which is a non-real-time track and does not have any constraints so that participants could explore any algorithms to obtain high speech quality. Targeting the real video conferencing room application, the challenge database was recorded from real speakers and all recording facilities were located by following the real setup of conferencing room. In this challenge, we open-sourced the list of open source clean speech and noise datasets, simulation scripts, and a baseline system for participants to develop their own system. The final ranking of the challenge will be decided by the subjective evaluation which is performed using Absolute Category Ratings (ACR) to estimate Mean Opinion Score (MOS), speech MOS (S-MOS), and noise MOS (N-MOS). This paper describes the challenge, tasks, datasets, and subjective evaluation. The baseline system which is a complex ratio mask based neural network and its experimental results are also presented.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2021

SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing

This paper presents the details of the SRIB-LEAP submission to the Confe...
research
06/16/2022

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

In real life, room effect, also known as room reverberation, and the pre...
research
03/11/2023

TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation

This paper describes aecX team's entry to the ICASSP 2023 acoustic echo ...
research
11/04/2020

IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines

The IEEE Spoken Language Technology Workshop (SLT) 2021 Alpha-mini Speec...
research
09/17/2019

A scalable noisy speech dataset and online subjective test framework

Background noise is a major source of quality impairments in Voice over ...
research
04/08/2021

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

In this paper, we present AISHELL-4, a sizable real-recorded Mandarin sp...
research
02/21/2022

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

The L3DAS22 Challenge is aimed at encouraging the development of machine...

Please sign up or login with your details

Forgot password? Click here to reset