Stay on Topic, Please: Aligning User Comments to the Content of a News Article

03/03/2021
by   Jumanah Alshehri, et al.
0

Social scientists have shown that up to 50 article have no relation to its journalistic content. In this study we propose a classification algorithm to categorize user comments posted to a new article base don their alignment to its content. The alignment seek to match user comments to an article based on similarity off content, entities in discussion, and topic. We proposed a BERTAC, BAERT-based approach that learn jointly article-comment embeddings and infers the relevance class of comments. We introduce an ordinal classification loss that penalizes the difference between the predicted and true label. We conduct a thorough study to show influence of the proposed loss on the learning process. The results on five representative news outlets show that our approach can learn the comment class with up to 36 average accuracy improvement compering to the baselines, and up to 25 compering to the BA-BC model. BA-BC is out approach that consists of two models aimed to capture dis-jointly the formal language of news articles and the informal language of comments. We also conduct a user study to evaluate human labeling performance to understand the difficulty of the classification task. The user agreement on comment-article alignment is "moderate" per Krippendorff's alpha score, which suggests that the classification task is difficult.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

08/14/2020

Cannot Predict Comment Volume of a News Article before (a few) Users Read It

Many news outlets allow users to contribute comments on topics about dai...
07/28/2018

Analyzing Uncivil Speech Provocation and Implicit Topics in Online Political News

Online news has made dissemination of information a faster and more effi...
09/13/2018

Unsupervised Machine Commenting with Neural Variational Topic Model

Article comments can provide supplementary opinions and facts for reader...
06/04/2019

Coherent Comment Generation for Chinese Articles with a Graph-to-Sequence Model

Automatic article commenting is helpful in encouraging user engagement a...
08/11/2017

Improved Abusive Comment Moderation with User Embeddings

Experimenting with a dataset of approximately 1.6M user comments from a ...
02/13/2021

Generating Diversified Comments via Reader-Aware Topic Modeling and Saliency Detection

Automatic comment generation is a special and challenging task to verify...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.