Identifying Experts in Question & Answer Portals: A Case Study on Data Science Competencies in Reddit

04/08/2022
by   Sofia Strukova, et al.
0

The irreplaceable key to the triumph of Question Answer (Q A) platforms is their users providing high-quality answers to the challenging questions posted across various topics of interest. Recently, the expert finding problem attracted much attention in information retrieval research. In this work, we inspect the feasibility of supervised learning model to identify data science experts in Reddit. Our method is based on the manual coding results where two data science experts labelled expert, non-expert and out-of-scope comments. We present a semi-supervised approach using the activity behaviour of every user, including Natural Language Processing (NLP), crowdsourced and user feature sets. We conclude that the NLP and user feature sets contribute the most to the better identification of these three classes It means that this method can generalise well within the domain. Moreover, we present different types of users, which can be helpful to detect various types of users in the future.

READ FULL TEXT
research
07/16/2020

Starting with data: advancing spatial data science by building and sharing high-quality datasets

Spatial data science has emerged in recent years as an interdisciplinary...
research
01/30/2022

Training and Evaluating a Jupyter Notebook Data Science Assistant

We study the feasibility of a Data Science assistant powered by a sequen...
research
11/14/2017

A Deep Learning Approach for Expert Identification in Question Answering Communities

In this paper, we describe an effective convolutional neural network fra...
research
02/05/2021

Categorical data as a stone guest in a data science project for predicting defective water meters

After a one-year long effort of research on the field, we developed a ma...
research
04/18/2020

Identifying Semantically Duplicate Questions Using Data Science Approach: A Quora Case Study

Identifying semantically identical questions on, Question and Answering ...
research
06/26/2019

Wise Data: A Novel Approach in Data Science from a Network Science Perspective

Human beings have been generating data since very long times ago. We ask...
research
11/24/2016

User Personalized Satisfaction Prediction via Multiple Instance Deep Learning

Community based question answering services have arisen as a popular kno...

Please sign up or login with your details

Forgot password? Click here to reset