We introduce M3-AUDIODEC, an innovative neural spatial audio codec desig...
We introduce region-customizable sound extraction (ReZero), a general an...
Echo cancellation and noise reduction are essential for full-duplex
comm...
This paper summarizes the cinematic demixing (CDX) track of the Sound
De...
This paper summarizes the music demixing (MDX) track of the Sound Demixi...
Thanks to the rapid development of diffusion models, unprecedented progr...
Node classifiers are required to comprehensively reduce prediction error...
Large language models (LLMs) can achieve highly effective performance on...
Modern neural-network-based speech processing systems are typically requ...
A universal and flexible design method for freeform surface that can mod...
For node classification, Graph Neural Networks (GNN) assign predefined l...
Usually, lesions are not isolated but are associated with the surroundin...
The performance of music source separation (MSS) models has been greatly...
Choral music separation refers to the task of extracting tracks of voice...
Exposure to bio-aerosols such as mold spores and pollen can lead to adve...
We report deep learning-based design of a massively parallel broadband
d...
The training of modern speech processing systems often requires a large
...
Classification of an object behind a random and unknown scattering mediu...
Most of the recent neural source separation systems rely on a masking-ba...
While the performance of offline neural speech separation systems has be...
High-resolution synthesis/projection of images over a large field-of-vie...
Privacy protection is a growing concern in the digital era, with machine...
Imaging through diffusive media is a challenging problem, where the exis...
Quantum key distribution (QKD) networks is expected to provide
informati...
The key towards learning informative node representations in graphs lies...
Frequency-domain neural beamformers are the mainstream methods for recen...
Recent advances in the design of neural network architectures, in partic...
Owing to its potential advantages such as scalability, low latency and p...
Nowadays, Graph Neural Networks (GNNs) following the Message Passing par...
Various volatile aerosols have been associated with adverse health effec...
The continuous speech separation (CSS) is a task to separate the speech
...
Conventional Supervised Learning approaches focus on the mapping from in...
Leveraging additional speaker information to facilitate speech separatio...
Ultra-lightweight model design is an important topic for the deployment ...
Various neural network architectures have been proposed in recent years ...
Modules in all existing speech separation networks can be categorized in...
Model size and complexity remain the biggest challenges in the deploymen...
This work presented a new drone-based face detection dataset Drone LAMS ...
Multi-speaker speech recognition of unsegmented recordings has diverse
a...
Multi-speaker speech recognition has been one of the keychallenges in
co...
Polarized light microscopy provides high contrast to birefringent specim...
Recent advances in deep learning have been providing non-intuitive solut...
Machine vision systems mostly rely on lens-based optical imaging
archite...
Many recent source separation systems are designed to separate a fixed n...
Deep learning speech separation algorithms have achieved great success i...
This paper describes a dataset and protocols for evaluating continuous s...
An important problem in ad-hoc microphone speech separation is how to
gu...
Recent studies in deep learning-based speech separation have proven the
...
Beamforming has been extensively investigated for multi-channel audio
pr...
Reconciliation is a crucial procedure in post-processing of continuous
v...