Evaluating Elements of Web-based Data Enrichment for Pseudo-Relevance Feedback Retrieval

03/10/2022
by   Timo Breuer, et al.
0

In this work, we analyze a pseudo-relevance retrieval method based on the results of web search engines. By enriching topics with text data from web search engine result pages and linked contents, we train topic-specific and cost-efficient classifiers that can be used to search test collections for relevant documents. Building upon attempts initially made at TREC Common Core 2018 by Grossman and Cormack, we address questions of system performance over time considering different search engines, queries, and test collections. Our experimental results show how and to which extent the considered components affect the retrieval performance. Overall, the analyzed method is robust in terms of average retrieval performance and a promising way to use web content for the data enrichment of relevance feedback methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2021

Biased or Not?: The Story of Two Search Engines

Search engines can be considered as a gate to the world of WEB, and they...
research
11/02/2022

Relevance Assessments for Web Search Evaluation: Should We Randomise or Prioritise the Pooled Documents? (CORRECTED VERSION)

In the context of depth-k pooling for constructing web search test colle...
research
01/31/2023

Archive TimeLine Summarization (ATLS): Conceptual Framework for Timeline Generation over Historical Document Collections

Archive collections are nowadays mostly available through search engines...
research
01/04/2013

Similarity Assessment through blocking and affordance assignment in Textual CBR

It has been conceived that children learn new objects through their affo...
research
12/03/2018

Modeling Temporal Evidence from External Collections

Newsworthy events are broadcast through multiple mediums and prompt the ...
research
03/01/2019

On Application of Learning to Rank for E-Commerce Search

E-Commerce (E-Com) search is an emerging important new application of in...
research
08/10/2012

Analysis of Statistical Hypothesis based Learning Mechanism for Faster Crawling

The growth of world-wide-web (WWW) spreads its wings from an intangible ...

Please sign up or login with your details

Forgot password? Click here to reset