A strong baseline for question relevancy ranking

The best systems at the SemEval-16 and SemEval-17 community question answering shared tasks -- a task that amounts to question relevancy ranking -- involve complex pipelines and manual feature engineering. Despite this, many of these still fail at beating the IR baseline, i.e., the rankings provided by Google's search engine. We present a strong baseline for question relevancy ranking by training a simple multi-task feed forward network on a bag of 14 distance measures for the input question pair. This baseline model, which is fast to train and uses only language-independent features, outperforms the best shared task systems on the task of retrieving relevant previously asked questions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2016

Addressing Community Question Answering in English and Arabic

This paper studies the impact of different types of features applied to ...
research
07/22/2019

ELI5: Long Form Question Answering

We introduce the first large-scale corpus for long-form question answeri...
research
10/23/2019

BanditRank: Learning to Rank Using Contextual Bandits

We propose an extensible deep learning method that uses reinforcement le...
research
04/16/2019

Query Expansion for Cross-Language Question Re-Ranking

Community question-answering (CQA) platforms have become very popular fo...
research
07/09/2015

FAQ-based Question Answering via Word Alignment

In this paper, we propose a novel word-alignment-based method to solve t...
research
08/20/2022

SemEval-2022 Task 8: Multi-lingual News Article Similarity

This work is about finding the similarity between a pair of news article...
research
06/15/2023

KUCST at CheckThat 2023: How good can we be with a generic model?

In this paper we present our method for tasks 2 and 3A at the CheckThat2...

Please sign up or login with your details

Forgot password? Click here to reset