We present Spatial LibriSpeech, a spatial audio dataset with over 650 ho...
Synthesizing natural head motion to accompany speech for an embodied
con...
Generating realistic lip motions to simulate speech production is key fo...
Robust speech recognition is a key prerequisite for semantic feature
ext...
We present an introspection of an audiovisual speech enhancement model. ...
Bipolar disorder, a severe chronic mental illness characterized by
patho...
Various psychological factors affect how individuals express emotions. Y...
Emotion recognition algorithms rely on data annotated with high quality
...
This work focuses on the use of acoustic cues for modeling turn-taking i...
The goal of continuous emotion recognition is to assign an emotion value...