Soft Seeded SSL Graphs for Unsupervised Semantic Similarity-based Retrieval

12/15/2017
by   Avikalp Srivastava, et al.
0

Semantic similarity based retrieval is playing an increasingly important role in many IR systems such as modern web search, question-answering, similar document retrieval etc. Improvements in retrieval of semantically similar content are very significant to applications like Quora, Stack Overflow, Siri etc. We propose a novel unsupervised model for semantic similarity based content retrieval, where we construct semantic flow graphs for each query, and introduce the concept of "soft seeding" in graph based semi-supervised learning (SSL) to convert this into an unsupervised model. We demonstrate the effectiveness of our model on an equivalent question retrieval problem on the Stack Exchange QA dataset, where our unsupervised approach significantly outperforms the state-of-the-art unsupervised models, and produces comparable results to the best supervised models. Our research provides a method to tackle semantic similarity based retrieval without any training data, and allows seamless extension to different domain QA communities, as well as to other semantic equivalence tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2019

FAQ Retrieval using Query-Question Similarity and BERT-Based Query-Answer Relevance

Frequently Asked Question (FAQ) retrieval is an important task where the...
research
04/23/2020

TCNN: Triple Convolutional Neural Network Models for Retrieval-based Question Answering System in E-commerce

Automatic question-answering (QA) systems have boomed during last few ye...
research
07/05/2018

Sanity Check: A Strong Alignment and Information Retrieval Baseline for Question Answering

While increasingly complex approaches to question answering (QA) have be...
research
05/04/2020

Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering

Evidence retrieval is a critical stage of question answering (QA), neces...
research
03/09/2018

An Unsupervised Model with Attention Autoencoders for Question Retrieval

Question retrieval is a crucial subtask for community question answering...
research
11/16/2021

QA4PRF: A Question Answering based Framework for Pseudo Relevance Feedback

Pseudo relevance feedback (PRF) automatically performs query expansion b...
research
05/04/2020

Semi-supervised lung nodule retrieval

Content based image retrieval (CBIR) provides the clinician with visual ...

Please sign up or login with your details

Forgot password? Click here to reset