Weakly-supervised learning has emerged as a promising approach to levera...
Learning meaningful frame-wise features on a partially labeled dataset i...
Text-based audio generation models have limitations as they cannot encom...
In this paper, we describe in detail our system for DCASE 2022 Task4. Th...
Recently, an event-based end-to-end model (SEDT) has been proposed for s...
The recently proposed Mean Teacher has achieved state-of-the-art results...
Sound event detection (SED) has gained increasing attention with its wid...