Much research effort is being applied to the task of compressing the
kno...
Diffusion models have shown promising results in speech enhancement, usi...
Speech emotion conversion is the task of converting the expressed emotio...
Several recent contributions in the field of iterative STFT phase retrie...
In this paper we present a method for single-channel wind noise reductio...
We present in this paper an informed single-channel dereverberation meth...
Since its inception, the field of deep speech enhancement has been domin...
Speech emotion conversion aims to convert the expressed emotion of a spo...
This paper introduces an audio-visual speech enhancement system that
lev...
We propose Audio-Visual Lightweight ITerative model (AVLIT), an effectiv...
In large part due to their implicit semantic modeling, self-supervised
l...
Supervised masking approaches in the time-frequency domain aim to employ...
In a multi-channel separation task with multiple speakers, we aim to rec...
Human-robot interaction relies on a noise-robust audio processing module...
In this paper, we present a causal speech signal improvement system that...
In this paper, we present a scheme for extending deep neural network-bas...
Recently, score-based generative models have been successfully employed ...
Diffusion models have shown a great ability at bridging the performance ...
Single-channel deep speech enhancement approaches often estimate a singl...
In this work, we utilize the high-fidelity generation abilities of diffu...
Diffusion probabilistic models have been recently used in a variety of t...
In a scenario with multiple persons talking simultaneously, the spatial
...
Diffusion-based generative models have had a high impact on the computer...
Recently, diffusion-based generative models have been introduced to the ...
As different people perceive others' emotional expressions differently, ...
The key advantage of using multiple microphones for speech enhancement i...
The SepFormer architecture shows very good results in speech separation....
Employing deep neural networks (DNNs) to directly learn filters for
mult...
Phase retrieval is a problem encountered not only in speech and audio
pr...
In this paper, a neural network-augmented algorithm for noise-robust onl...
A two-stage online dereverberation algorithm for hearing devices is pres...
This work focuses on online dereverberation for hearing devices using th...
Score-based generative models (SGMs) have recently shown impressive resu...
While phase-aware speech processing has been receiving increasing attent...
Speech enhancement in the time-frequency domain is often performed by
es...
Recent advances in the design of neural network architectures, in partic...
Emotions are subjective constructs. Recent end-to-end speech emotion
rec...
Recently, the standard variational autoencoder has been successfully use...
The majority of multichannel speech enhancement algorithms are two-step
...
Recently, a generative variational autoencoder (VAE) has been proposed f...
Recently, variational autoencoders have been successfully used to learn ...
Reinforcement learning is a promising method to accomplish robotic contr...
This paper analyzes the generalization of speech enhancement algorithms ...
Robust and accurate estimation of liquid height lies as an essential par...
In this work, we investigate if the learned encoder of the end-to-end
co...
In this paper, we focus on the challenging perception problem in robotic...
Enhancing noisy speech is an important task to restore its quality and t...
Enhancing noisy speech is an important task to restore its quality and t...
This report presents our audio event detection system submitted for Task...