Gaurav Sahu

research

∙ 07/18/2023

Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

We present the Multi-Modal Discussion Transformer (mDT), a novel multi-m...

0 Liam Hebert, et al. ∙

research

∙ 12/20/2022

Future Sight: Dynamic Story Generation with Large Pretrained Language Models

Recent advances in deep learning research, such as transformers, have bo...

0 Brian D. Zimmerman, et al. ∙

research

∙ 10/27/2022

LyricJam Sonic: A Generative System for Real-Time Composition and Musical Improvisation

Electronic music artists and sound designers have unique workflow practi...

0 Olga Vechtomova, et al. ∙

research

∙ 04/05/2022

Data Augmentation for Intent Classification with Off-the-shelf Large Language Models

Data augmentation is a widely employed technique to alleviate the proble...

13 Gaurav Sahu, et al. ∙

research

∙ 11/11/2021

Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management

In this paper, we present an approach for predicting trust links between...

0 Alexandre Parmentier, et al. ∙

research

∙ 06/03/2021

LyricJam: A system for generating lyrics for live instrumental music

We describe a real-time system that receives a live audio stream from a ...

15 Olga Vechtomova, et al. ∙

research

∙ 05/03/2021

Towards A Multi-agent System for Online Hate Speech Detection

This paper envisions a multi-agent system for detecting the presence of ...

0 Gaurav Sahu, et al. ∙

research

∙ 09/30/2020

Generation of lyrics lines conditioned on music audio clips

We present a system for generating novel lyrics lines conditioned on mus...

0 Olga Vechtomova, et al. ∙

research

∙ 11/10/2019

Dynamic Fusion for Multimodal Data

Effective fusion of data from multiple modalities, such as video, speech...

0 Gaurav Sahu, et al. ∙

research

∙ 11/10/2019

Conditional Response Generation Using Variational Alignment

Generating relevant/conditioned responses in dialog is challenging, and ...

0 Kashif Khan, et al. ∙

research

∙ 04/12/2019

Multimodal Speech Emotion Recognition and Ambiguity Resolution

Identifying emotion from speech is a non-trivial task pertaining to the ...

0 Gaurav Sahu, et al. ∙

research

∙ 09/05/2018

Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit

The configurational information in sentences of a free word order langua...

0 Amrith Krishna, et al. ∙

Gaurav Sahu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro