We present the Multi-Modal Discussion Transformer (mDT), a novel multi-m...
Recent advances in deep learning research, such as transformers, have
bo...
Electronic music artists and sound designers have unique workflow practi...
Data augmentation is a widely employed technique to alleviate the proble...
In this paper, we present an approach for predicting trust links between...
We describe a real-time system that receives a live audio stream from a ...
This paper envisions a multi-agent system for detecting the presence of ...
We present a system for generating novel lyrics lines conditioned on mus...
Effective fusion of data from multiple modalities, such as video, speech...
Generating relevant/conditioned responses in dialog is challenging, and
...
Identifying emotion from speech is a non-trivial task pertaining to the
...
The configurational information in sentences of a free word order langua...