Comment Ranking Diversification in Forum Discussions

02/27/2020
by   Curtis G. Northcutt, et al.
0

Viewing consumption of discussion forums with hundreds or more comments depends on ranking because most users only view top-ranked comments. When comments are ranked by an ordered score (e.g. number of replies or up-votes) without adjusting for semantic similarity of near-ranked comments, top-ranked comments are more likely to emphasize the majority opinion and incur redundancy. In this paper, we propose a top K comment diversification re-ranking model using Maximal Marginal Relevance (MMR) and evaluate its impact in three categories: (1) semantic diversity, (2) inclusion of the semantics of lower-ranked comments, and (3) redundancy, within the context of a HarvardX course discussion forum. We conducted a double-blind, small-scale evaluation experiment requiring subjects to select between the top 5 comments of a diversified ranking and a baseline ranking ordered by score. For three subjects, across 100 trials, subjects selected the diversified (75 diversification) ranking as significantly (1) more diverse, (2) more inclusive, and (3) less redundant. Within each category, inter-rater reliability showed moderate consistency, with typical Cohen-Kappa scores near 0.2. Our findings suggest that our model improves (1) diversification, (2) inclusion, and (3) redundancy, among top K ranked comments in online discussion forums.

READ FULL TEXT
research
04/01/2019

Does the hiding mechanism for Stack Overflow comments work well? No!

Stack Overflow has accumulated millions of answers. Informative comments...
research
03/28/2021

Extractive Summarization of Related Bug-fixing Comments in Support of Bug Repair

When developers investigate a new bug report, they search for similar pr...
research
09/18/2018

Improving Moderation of Online Discussions via Interpretable Neural Models

Growing amount of comments make online discussions difficult to moderate...
research
07/08/2015

Talking to the crowd: What do people react to in online discussions?

This paper addresses the question of how language use affects community ...
research
06/12/2018

Deep Learning to Detect Redundant Method Comments

Comments in software are critical for maintenance and reuse. But apart f...
research
03/27/2022

Budget-Constrained Reinforcement of Ranked Objects

Commercial entries, such as hotels, are ranked according to score by a s...
research
04/20/2017

Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads

This paper addresses the problem of predicting popularity of comments in...

Please sign up or login with your details

Forgot password? Click here to reset