Sreyan Ghosh

research

∙ 09/18/2023

RECAP: Retrieval-Augmented Audio Captioning

We present RECAP (REtrieval-Augmented Audio CAPtioning), a novel and eff...

0 Sreyan Ghosh, et al. ∙

research

∙ 08/23/2023

AdVerb: Visually Guided Audio Dereverberation

We present AdVerb, a novel audio-visual dereverberation framework that u...

0 Sanjoy Chowdhury, et al. ∙

research

∙ 08/19/2023

ASPIRE: Language-Guided Augmentation for Robust Image Classification

Neural image classifiers can often learn to make predictions by overly r...

0 Sreyan Ghosh, et al. ∙

research

∙ 06/01/2023

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER

Complex Named Entity Recognition (NER) is the task of detecting linguist...

0 Sreyan Ghosh, et al. ∙

research

∙ 05/18/2023

BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER

Biomedical Named Entity Recognition (BioNER) is the fundamental task of ...

0 Sreyan Ghosh, et al. ∙

research

∙ 03/10/2023

UNFUSED: UNsupervised Finetuning Using SElf supervised Distillation

In this paper, we introduce UnFuSeD, a novel approach to leverage self-s...

0 Ashish Seth, et al. ∙

research

∙ 03/02/2023

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network

The tremendous growth of social media users interacting in online conver...

0 Sreyan Ghosh, et al. ∙

research

∙ 11/27/2022

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances

Disfluency, though originating from human spoken utterances, is primaril...

0 Sreyan Ghosh, et al. ∙

research

∙ 11/02/2022

SLICER: Learning universal audio representations using low-resource self-supervised pre-training

We present a new Self-Supervised Learning (SSL) approach to pre-train en...

0 Ashish Seth, et al. ∙

research

∙ 11/02/2022

MAST: Multiscale Audio Spectrogram Transformers

We present Multiscale Audio Spectrogram Transformer (MAST) for audio cla...

0 Sreyan Ghosh, et al. ∙

research

∙ 11/02/2022

data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

In this paper, we propose a new Self-Supervised Learning (SSL) algorithm...

0 Vasista Sai Lodagala, et al. ∙

research

∙ 10/05/2022

CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations

While Self-Supervised Learning has helped reap the benefit of the scale ...

0 Vasista Sai Lodagala, et al. ∙

research

∙ 03/31/2022

Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition

Self-supervised learning (SSL) to learn high-level speech representation...

0 Lodagala V S V Durga Prasad, et al. ∙

research

∙ 03/31/2022

PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations

While self-supervised speech representation learning (SSL) models serve ...

0 Lodagala V S V Durga Prasad, et al. ∙

research

∙ 03/31/2022

A Discourse Aware Sequence Learning Approach for Emotion Recognition in Conversations

The expression of emotions is a crucial part of daily human communicatio...

0 Sreyan Ghosh, et al. ∙

research

∙ 03/31/2022

MMER: Multimodal Multi-task learning for Emotion Recognition in Spoken Utterances

Emotion Recognition (ER) aims to classify human utterances into differen...

0 Harshvardhan Srivastava, et al. ∙

research

∙ 03/30/2022

Span Classification with Structured Information for Disfluency Detection in Spoken Utterances

Existing approaches in disfluency detection focus on solving a token-lev...

0 Sreyan Ghosh, et al. ∙

research

∙ 03/25/2022

DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning

Inspired by the recent progress in self-supervised learning for computer...

0 Sreyan Ghosh, et al. ∙

research

∙ 12/18/2021

Leveraging Transformers for Hate Speech Detection in Conversational Code-Mixed Tweets

In the current era of the internet, where social media platforms are eas...

13 Zaki Mustafa Farooqi, et al. ∙

research

∙ 10/17/2021

Deep Clustering For General-Purpose Audio Representations

We introduce DECAR, a self-supervised pre-training approach for learning...

0 Sreyan Ghosh, et al. ∙

research

∙ 10/14/2021

Speech Toxicity Analysis: A New Spoken Language Processing Task

Toxic speech, also known as hate speech, is regarded as one of the cruci...

27 Sreyan Ghosh, et al. ∙

research

∙ 05/28/2021

Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for Multiple Toxic Span Extraction from Online Comments

Social network platforms are generally used to share positive, construct...

0 Sreyan Ghosh, et al. ∙

research

∙ 01/10/2021

Cisco at AAAI-CAD21 shared task: Predicting Emphasis in Presentation Slides using Contextualised Embeddings

This paper describes our proposed system for the AAAI-CAD21 shared task:...

0 Sreyan Ghosh, et al. ∙

research

∙ 05/22/2020

End-to-end Named Entity Recognition from English Speech

Named entity recognition (NER) from text has been a widely studied probl...

0 Hemant Yadav, et al. ∙

Sreyan Ghosh

Featured Co-authors

Sign in with Google

Consider DeepAI Pro