Match^2: A Matching over Matching Model for Similar Question Identification

06/21/2020
by   Zizhen Wang, et al.
0

Community Question Answering (CQA) has become a primary means for people to acquire knowledge, where people are free to ask questions or submit answers. To enhance the efficiency of the service, similar question identification becomes a core task in CQA which aims to find a similar question from the archived repository whenever a new question is asked. However, it has long been a challenge to properly measure the similarity between two questions due to the inherent variation of natural language, i.e., there could be different ways to ask a same question or different questions sharing similar expressions. To alleviate this problem, it is natural to involve the existing answers for the enrichment of the archived questions. Traditional methods typically take a one-side usage, which leverages the answer as some expanded representation of the corresponding question. Unfortunately, this may introduce unexpected noises into the similarity computation since answers are often long and diverse, leading to inferior performance. In this work, we propose a two-side usage, which leverages the answer as a bridge of the two questions. The key idea is based on our observation that similar questions could be addressed by similar parts of the answer while different questions may not. In other words, we can compare the matching patterns of the two questions over the same answer to measure their similarity. In this way, we propose a novel matching over matching model, namely Match^2, which compares the matching patterns between two question-answer pairs for similar question identification. Empirical experiments on two benchmark datasets demonstrate that our model can significantly outperform previous state-of-the-art methods on the similar question identification task.

READ FULL TEXT
08/04/2020

Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs

People increasingly search online for answers to their medical questions...
11/04/2020

Answer Identification in Collaborative Organizational Group Chat

We present a simple unsupervised approach for answer identification in o...
04/22/2018

Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching

Community-based question answering (CQA) websites represent an important...
05/21/2021

GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers

Toward a computer-assisted marking for descriptive math questions,this p...
10/07/2017

Group Sparse CNNs for Question Classification with Answer Sets

Question classification is an important task with wide applications. How...
10/09/2019

Domain-Relevant Embeddings for Medical Question Similarity

The rate at which medical questions are asked online significantly excee...
12/17/2019

Knowledge-Enhanced Attentive Learning for Answer Selection in Community Question Answering Systems

In the community question answering (CQA) system, the answer selection t...