Duplicate Question Retrieval and Confirmation Time Prediction in Software Communities

09/10/2023
by   Rima Hazra, et al.
0

Community Question Answering (CQA) in different domains is growing at a large scale because of the availability of several platforms and huge shareable information among users. With the rapid growth of such online platforms, a massive amount of archived data makes it difficult for moderators to retrieve possible duplicates for a new question and identify and confirm existing question pairs as duplicates at the right time. This problem is even more critical in CQAs corresponding to large software systems like askubuntu where moderators need to be experts to comprehend something as a duplicate. Note that the prime challenge in such CQA platforms is that the moderators are themselves experts and are therefore usually extremely busy with their time being extraordinarily expensive. To facilitate the task of the moderators, in this work, we have tackled two significant issues for the askubuntu CQA platform: (1) retrieval of duplicate questions given a new question and (2) duplicate question confirmation time prediction. In the first task, we focus on retrieving duplicate questions from a question pool for a particular newly posted question. In the second task, we solve a regression problem to rank a pair of questions that could potentially take a long time to get confirmed as duplicates. For duplicate question retrieval, we propose a Siamese neural network based approach by exploiting both text and network-based features, which outperforms several state-of-the-art baseline techniques. Our method outperforms DupPredictor and DUPE by 5 confirmation time prediction, we have used both the standard machine learning models and neural network along with the text and graph-based features. We obtain Spearman's rank correlation of 0.20 and 0.213 (statistically significant) for text and graph based features respectively.

READ FULL TEXT

page 1

page 3

research
11/24/2016

Question Retrieval for Community-based Question Answering via Heterogeneous Network Integration Learning

Community based question answering platforms have attracted substantial ...
research
07/05/2020

When text simplification is not enough: could a graph-based visualization facilitate consumers' comprehension of dietary supplement information?

Dietary supplements (DSs) are popular but not always safe. Consumers usu...
research
03/14/2017

Exploring Question Understanding and Adaptation in Neural-Network-Based Question Answering

The last several years have seen intensive interest in exploring neural-...
research
05/29/2019

Large Scale Question Paraphrase Retrieval with Smoothed Deep Metric Learning

The goal of a Question Paraphrase Retrieval (QPR) system is to retrieve ...
research
11/03/2019

Scene Graph based Image Retrieval – A case study on the CLEVR Dataset

With the prolification of multimodal interaction in various domains, rec...
research
12/18/2022

Task Preferences across Languages on Community Question Answering Platforms

With the steady emergence of community question answering (CQA) platform...
research
09/03/2021

Contextualized Embeddings based Convolutional Neural Networks for Duplicate Question Identification

Question Paraphrase Identification (QPI) is a critical task for large-sc...

Please sign up or login with your details

Forgot password? Click here to reset