b'Shah Nawaz'

research

∙ 09/18/2023

Frame-to-Utterance Convergence: A Spectra-Temporal Approach for Unified Spoofing Detection

Voice spoofing attacks pose a significant threat to automated speaker ve...

0 Awais Khan, et al. ∙

research

∙ 07/31/2023

DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement Estimation in Conversation

Conversational engagement estimation is posed as a regression problem, e...

0 Vu Ngoc Tu, et al. ∙

research

∙ 03/10/2023

Single-branch Network for Multimodal Training

With the rapid growth of social media platforms, users are sharing billi...

0 Muhammad Saad Saeed, et al. ∙

research

∙ 02/25/2023

Speaker Recognition in Realistic Scenario Using Multimodal Data

In recent years, an association is established between faces and voices ...

0 Saqlain Hussain Shah, et al. ∙

research

∙ 08/22/2022

Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

Recent years have seen an increased interest in establishing association...

0 Muhammad Saad Saeed, et al. ∙

research

∙ 04/15/2022

Guiding Attention using Partial-Order Relationships for Image Captioning

The use of attention models for automated image captioning has enabled m...

0 Murad Popattia, et al. ∙

research

∙ 01/03/2022

Semantically Grounded Visual Embeddings for Zero-Shot Learning

Zero-shot learning methods rely on fixed visual and semantic embeddings,...

0 Shah Nawaz, et al. ∙

research

∙ 12/20/2021

Fusion and Orthogonal Projection for Improved Face-Voice Association

We study the problem of learning association between face and voice, whi...

12 Muhammad Saad Saeed, et al. ∙

research

∙ 04/28/2020

Cross-modal Speaker Verification and Recognition: A Multilingual Perspective

Recent years have seen a surge in finding association between faces and ...

10 Muhammad Saad Saeed, et al. ∙

research

∙ 09/18/2019

Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals

We propose a novel deep training algorithm for joint representation of a...

0 Shah Nawaz, et al. ∙

research

∙ 09/09/2019

Picture What you Read

Visualization refers to our ability to create an image in our head based...

7 Ignazio Gallo, et al. ∙

research

∙ 09/03/2019

Do Cross Modal Systems Leverage Semantic Relationships?

Current cross-modal retrieval systems are evaluated using R@K measure wh...

0 Shah Nawaz, et al. ∙

research

∙ 04/02/2019

Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition

With massive explosion of social media such as Twitter and Instagram, pe...

0 Omer Arshad, et al. ∙

research

∙ 10/16/2018

Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions

Majority of the current dimensionality reduction or retrieval techniques...

0 Muhammad Kamran Janjua, et al. ∙

research

∙ 10/03/2018

Image and Encoded Text Fusion for Multi-Modal Classification

Multi-modal approaches employ data from multiple input streams such as t...

2 Ignazio Gallo, et al. ∙

research

∙ 08/31/2018

Seeing Colors: Learning Semantic Text Encoding for Classification

The question we answer with this work is: can we convert a text document...

8 Shah Nawaz, et al. ∙

research

∙ 07/23/2018

Git Loss for Deep Face Recognition

Convolutional Neural Networks (CNNs) have been widely used in computer v...

0 Alessandro Calefati, et al. ∙

research

∙ 07/19/2018

Revisiting Cross Modal Retrieval

This paper proposes a cross-modal retrieval system that leverages on ima...

0 Shah Nawaz, et al. ∙

Shah Nawaz

Featured Co-authors

Sign in with Google

Consider DeepAI Pro