Pointer over Attention: An Improved Bangla Text Summarization Approach Using Hybrid Pointer Generator Network

11/19/2021
by   Nobel Dhar, et al.
0

Despite the success of the neural sequence-to-sequence model for abstractive text summarization, it has a few shortcomings, such as repeating inaccurate factual details and tending to repeat themselves. We propose a hybrid pointer generator network to solve the shortcomings of reproducing factual details inadequately and phrase repetition. We augment the attention-based sequence-to-sequence using a hybrid pointer generator network that can generate Out-of-Vocabulary words and enhance accuracy in reproducing authentic details and a coverage mechanism that discourages repetition. It produces a reasonable-sized output text that preserves the conceptual integrity and factual information of the input article. For evaluation, we primarily employed "BANSData" - a highly adopted publicly available Bengali dataset. Additionally, we prepared a large-scale dataset called "BANS-133" which consists of 133k Bangla news articles associated with human-generated summaries. Experimenting with the proposed model, we achieved ROUGE-1 and ROUGE-2 scores of 0.66, 0.41 for the "BANSData" dataset and 0.67, 0.42 for the BANS-133k" dataset, respectively. We demonstrated that the proposed system surpasses previous state-of-the-art Bengali abstractive summarization techniques and its stability on a larger dataset. "BANS-133" datasets and code-base will be publicly available for research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2017

Get To The Point: Summarization with Pointer-Generator Networks

Neural sequence-to-sequence models have provided a viable new approach f...
research
01/20/2019

Improving generation quality of pointer networks via guided attention

Pointer generator networks have been used successfully for abstractive s...
research
05/31/2021

Reinforced Generative Adversarial Network for Abstractive Text Summarization

Sequence-to-sequence models provide a viable new approach to generative ...
research
10/20/2018

Abstractive Summarization Using Attentive Neural Techniques

In a world of proliferating data, the ability to rapidly summarize text ...
research
05/21/2022

UVA Resources for the Biomedical Vocabulary Alignment at Scale in the UMLS Metathesaurus

The construction and maintenance process of the UMLS (Unified Medical La...
research
09/16/2019

Automatic Generation of Pull Request Descriptions

Enabled by the pull-based development model, developers can easily contr...
research
02/25/2020

A more abstractive summarization model

Pointer-generator network is an extremely popular method of text summari...

Please sign up or login with your details

Forgot password? Click here to reset