Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization

09/17/2021
by   Ahmed Magooda, et al.
11

This paper explores three simple data manipulation techniques (synthesis, augmentation, curriculum) for improving abstractive summarization models without the need for any additional data. We introduce a method of data synthesis with paraphrasing, a data augmentation technique with sample mixing, and curriculum learning with two new difficulty metrics based on specificity and abstractiveness. We conduct experiments to show that these three techniques can help improve abstractive summarization across two summarization models and two different small datasets. Furthermore, we show that these techniques can improve performance when applied in isolation and when combined.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2021

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Few-shot text classification is a fundamental NLP task in which a model ...
research
09/10/2021

Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning

We introduce EfficientCL, a memory-efficient continual pretraining metho...
research
12/20/2022

End to End Generative Meta Curriculum Learning For Medical Data Augmentation

Current medical image synthetic augmentation techniques rely on intensiv...
research
03/31/2022

SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy

Deep learning based singing voice synthesis (SVS) systems have been demo...
research
08/17/2022

PCC: Paraphrasing with Bottom-k Sampling and Cyclic Learning for Curriculum Data Augmentation

Curriculum Data Augmentation (CDA) improves neural models by presenting ...
research
02/02/2023

Curriculum-Guided Abstractive Summarization

Recent Transformer-based summarization models have provided a promising ...
research
06/01/2023

Improving the Robustness of Summarization Systems with Dual Augmentation

A robust summarization system should be able to capture the gist of the ...

Please sign up or login with your details

Forgot password? Click here to reset