Word Embedding based Correlation Model for Question/Answer Matching

11/15/2015
by   Yikang Shen, et al.
0

With the development of community based question answering (Q&A) services, a large scale of Q&A archives have been accumulated and are an important information and knowledge resource on the web. Question and answer matching has been attached much importance to for its ability to reuse knowledge stored in these systems: it can be useful in enhancing user experience with recurrent questions. In this paper, we try to improve the matching accuracy by overcoming the lexical gap between question and answer pairs. A Word Embedding based Correlation (WEC) model is proposed by integrating advantages of both the translation model and word embedding, given a random pair of words, WEC can score their co-occurrence probability in Q&A pairs and it can also leverage the continuity and smoothness of continuous space word representation to deal with new pairs of words that are rare in the training parallel text. An experimental study on Yahoo! Answers dataset and Baidu Zhidao dataset shows this new method's promising potential.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2019

Are we asking the right questions in MovieQA?

Joint vision and language tasks like visual question answering are fasci...
research
04/22/2018

Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching

Community-based question answering (CQA) websites represent an important...
research
11/12/2016

Training IBM Watson using Automatically Generated Question-Answer Pairs

IBM Watson is a cognitive computing system capable of question answering...
research
11/22/2019

Joint Learning of Answer Selection and Answer Summary Generation in Community Question Answering

Community question answering (CQA) gains increasing popularity in both a...
research
04/21/2020

Word Embedding-based Text Processing for Comprehensive Summarization and Distinct Information Extraction

In this paper, we propose two automated text processing frameworks speci...
research
01/23/2023

Breaking the Boundaries of Knowledge Space: Analyzing the Knowledge Spanning on the Q A Website through Word Embeddings

The challenge of raising a creative question exists in recombining diffe...
research
03/30/2021

Representing ELMo embeddings as two-dimensional text online

We describe a new addition to the WebVectors toolkit which is used to se...

Please sign up or login with your details

Forgot password? Click here to reset