Robust video scene classification models should capture the spatial
(pix...
Deep neural networks have been shown to perform poorly on adversarial
ex...
The introduction of Transformer model has led to tremendous advancements...
Multi-modal machine learning (ML) models can process data in multiple
mo...
Multimodal ML models can process data in multiple modalities (e.g., vide...
Emotion recognition is a classic field of research with a typical setup
...
Generative Adversarial Networks (GANs) have gained a lot of attention fr...
Sentiment classification involves quantifying the affective reaction of ...
Recently, generative adversarial networks and adversarial autoencoders h...