DeepTitle – Leveraging BERT to generate Search Engine Optimized Headlines

07/22/2021
by   Cristian Anastasiu, et al.
0

Automated headline generation for online news articles is not a trivial task - machine generated titles need to be grammatically correct, informative, capture attention and generate search traffic without being "click baits" or "fake news". In this paper we showcase how a pre-trained language model can be leveraged to create an abstractive news headline generator for German language. We incorporate state of the art fine-tuning techniques for abstractive text summarization, i.e. we use different optimizers for the encoder and decoder where the former is pre-trained and the latter is trained from scratch. We modify the headline generation to incorporate frequently sought keywords relevant for search engine optimization. We conduct experiments on a German news data set and achieve a ROUGE-L-gram F-score of 40.02. Furthermore, we address the limitations of ROUGE for measuring the quality of text summarization by introducing a sentence similarity metric and human evaluation.

READ FULL TEXT
research
03/24/2019

Neural Abstractive Text Summarization and Fake News Detection

In this work, we study abstractive text summarization by exploring diffe...
research
07/13/2023

Tackling Fake News in Bengali: Unraveling the Impact of Summarization vs. Augmentation on Pre-trained Language Models

With the rise of social media and online news sources, fake news has bec...
research
10/10/2022

Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning

Recent language generative models are mostly trained on large-scale data...
research
04/09/2023

Similarity-Aware Multimodal Prompt Learning for Fake News Detection

The standard paradigm for fake news detection mainly utilizes text infor...
research
01/03/2021

News Image Steganography: A Novel Architecture Facilitates the Fake News Identification

A larger portion of fake news quotes untampered images from other source...
research
02/06/2020

Introducing Aspects of Creativity in Automatic Poetry Generation

Poetry Generation involves teaching systems to automatically generate te...
research
04/01/2021

"TL;DR:" Out-of-Context Adversarial Text Summarization and Hashtag Recommendation

This paper presents Out-of-Context Summarizer, a tool that takes arbitra...

Please sign up or login with your details

Forgot password? Click here to reset