The image source method (ISM) is often used to simulate room acoustics d...
Knowing the room geometry may be very beneficial for many audio applicat...
In TV services, dialogue level personalization is key to meeting user
pr...
We propose a Beamformer-guided Target Speaker Extraction (BG-TSE) method...
We consider the task of region-based source separation of reverberant
mu...
A study is presented in which a contrastive learning approach is used to...
Research into multi-modal perception, human cognition, behavior, and
att...
A dataset of anechoic recordings of various sound sources encountered in...
Verifying the identity of a speaker is crucial in modern human-machine
i...
The estimation of reverberation time from real-world signals plays a cen...
In recent years, researchers have become increasingly interested in spea...
The direction-of-arrival (DOA) of sound sources is an essential acoustic...
State-of-the-art separation of desired signal components from a mixture ...
Deep learning (DL) based direction of arrival (DOA) estimation is an act...
Audio-visual speech enhancement (AVSE) methods use both audio and visual...
The performance of machine learning algorithms is known to be negatively...
Feedback delay networks (FDNs) are recursive filters, which are widely u...
Signal extraction from a single-channel mixture with additional undesire...
In a recent work on direction-of-arrival (DOA) estimation of multiple
sp...
The difference-to-sum power ratio was proposed and used to suppress wind...
Supervised learning based methods for source localization, being data dr...
A novel multi-channel artificial wind noise generator based on a fluid
d...
A novel multi-channel artificial wind noise generator based on a fluid
d...
The task of estimating the maximum number of concurrent speakers from si...
The problem of multi-speaker localization is formulated as a multi-class...
A convolution neural network (CNN) based classification method for broad...