While depression has been studied via multimodal non-verbal behavioural ...
Driver Monitoring Systems (DMSs) are crucial for safe hand-over actions ...
Heterogeneous graphs provide a compact, efficient, and scalable way to m...
We explore the efficacy of multimodal behavioral cues for explainable
pr...
Driver distractions are known to be the dominant cause of road accidents...
Perception of auditory events is inherently multimodal relying on both a...
Active speaker detection (ASD) in videos with multiple speakers is a
cha...
State-of-the-art crowd counting models follow an encoder-decoder approac...
Large scale databases with high-quality manual annotations are scarce in...
We demonstrate the utility of elementary head-motion units termed kineme...
We address the problem of active speaker detection through a new framewo...
We introduce the problem of multi-camera trajectory forecasting (MCTF), ...
The mainstream image captioning models rely on Convolutional Neural Netw...
An effective approach to automated movie content analysis involves build...
We propose a computational framework for ranking images (group photos in...
We introduce the task of multi-camera trajectory forecasting (MCTF), whe...
We present an audio-visual multimodal approach for the task of zeroshot
...
This paper introduces the problem of multiple object forecasting (MOF), ...
We introduce the problem of learning affective correspondence between au...
We investigate the effect and usefulness of spontaneity in speech (i.e.
...
Public speaking is an important aspect of human communication and
intera...
A successful approach to image quality assessment involves comparing the...
A new line of research uses compression methods to measure the similarit...