Most existing audio-text retrieval (ATR) methods focus on constructing
c...
In text-audio retrieval (TAR) tasks, due to the heterogeneity of content...
Existing weakly supervised sound event detection (WSSED) work has not
ex...
Existing deep learning based speech enhancement (SE) methods either use ...