Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context

06/11/2020
by   Hankyol Lee, et al.
0

We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input-output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2021

Sarcasm Detection in Twitter – Performance Impact while using Data Augmentation: Word Embeddings

Sarcasm is the use of words usually used to either mock or annoy someone...
research
10/03/2019

ANDA: A Novel Data Augmentation Technique Applied to Salient Object Detection

In this paper, we propose a novel data augmentation technique (ANDA) app...
research
09/09/2023

Data Augmentation for Conversational AI

Advancements in conversational systems have revolutionized information a...
research
07/27/2018

Characters Detection on Namecard with faster RCNN

We apply Faster R-CNN to the detection of characters in namecard, in ord...
research
11/24/2021

Revisiting Contextual Toxicity Detection in Conversations

Understanding toxicity in user conversations is undoubtedly an important...
research
05/17/2018

Counterexample-Guided Data Augmentation

We present a novel framework for augmenting data sets for machine learni...
research
07/16/2019

Neural Language Model Based Training Data Augmentation for Weakly Supervised Early Rumor Detection

The scarcity and class imbalance of training data are known issues in cu...

Please sign up or login with your details

Forgot password? Click here to reset