SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism

04/04/2023
by   Mehwish Fatima, et al.
0

Cross-lingual science journalism generates popular science stories of scientific articles different from the source language for a non-expert audience. Hence, a cross-lingual popular summary must contain the salient content of the input document, and the content should be coherent, comprehensible, and in a local language for the targeted audience. We improve these aspects of cross-lingual summary generation by joint training of two high-level NLP tasks, simplification and cross-lingual summarization. The former task reduces linguistic complexity, and the latter focuses on cross-lingual abstractive summarization. We propose a novel multi-task architecture - SimCSum consisting of one shared encoder and two parallel decoders jointly learning simplification and cross-lingual summarization. We empirically investigate the performance of SimCSum by comparing it with several strong baselines over several evaluation metrics and by human evaluation. Overall, SimCSum demonstrates statistically significant improvements over the state-of-the-art on two non-synthetic cross-lingual scientific datasets. Furthermore, we conduct an in-depth investigation into the linguistic properties of generated summaries and an error analysis.

READ FULL TEXT

page 14

page 15

page 16

page 17

page 18

research
04/23/2022

WikiMulti: a Corpus for Cross-Lingual Summarization

Cross-lingual summarization (CLS) is the task to produce a summary in on...
research
07/08/2023

Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation

Most existing cross-lingual summarization (CLS) work constructs CLS corp...
research
03/31/2021

A Neighbourhood Framework for Resource-Lean Content Flagging

We propose a novel interpretable framework for cross-lingual content fla...
research
05/23/2023

μPLAN: Summarizing using a Content Plan as Cross-Lingual Bridge

Cross-lingual summarization consists of generating a summary in one lang...
research
11/06/2022

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

With the recent developments in cross-lingual Text-to-Speech (TTS) syste...
research
06/22/2023

Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation

While summarization has been extensively researched in natural language ...
research
03/23/2022

A Survey on Cross-Lingual Summarization

Cross-lingual summarization is the task of generating a summary in one l...

Please sign up or login with your details

Forgot password? Click here to reset