In this paper, we present a statistical beamforming algorithm as a
pre-p...
With the advent of general-purpose speech representations from large-sca...
Inspired by humans comprehending speech in a multi-modal manner, various...
Multilingual speech data often suffer from long-tailed language distribu...
This paper addresses the noisy label issue in audio event detection (AED...
In general, the performance of automatic speech recognition (ASR) system...
We propose a novel reflection color model consisting of body essence and...