Robot vision often involves a large computational load due to large imag...
Spatial filters can exploit deep-learning-based speech enhancement model...
Transformers have recently achieved state-of-the-art performance in spee...
This paper introduces the Fast Cross-Correlation (FCC) method for Time
D...
This paper introduces SMP-PHAT, which performs direction of arrival (DoA...
This paper introduces SmartBelt, a wearable microphone array on a belt t...
Transformers have enabled major improvements in deep learning. They ofte...
Recent work on monaural source separation has shown that performance can...
In recent years, deep learning based source separation has achieved
impr...
This paper introduces a new method referred to as KISS-GEV (for Keep It ...
We propose a novel low-complexity lidar gesture recognition system for m...
SpeechBrain is an open-source and all-in-one speech toolkit. It is desig...
As telecommunications technology progresses, telehealth frameworks are
b...
Artificial audition aims at providing hearing capabilities to machines,
...
We present a system for localizing sound sources in a room with several
...
This paper introduces BIRD, the Big Impulse Response Dataset. This open
...
In dynamic environments, performance of visual SLAM techniques can be
im...
A microphone array can provide a mobile robot with the capability of
loc...
This paper proposes a straightforward 2-D method to spatially calibrate ...
This paper proposes sound event localization and detection methods from
...
This paper introduces a variant of the Singular Value Decomposition with...
Human-robot interaction in natural settings requires filtering out the
d...
This paper investigates the accuracy of various Generalized Cross-Correl...
This paper introduces a new localization method called SVD-PHAT. The SVD...
Speech recognizers trained on close-talking speech do not generalize to
...