Studying Ranking-Incentivized Web Dynamics

05/28/2020
by   Ziv Vasilisky, et al.
0

The ranking incentives of many authors of Web pages play an important role in the Web dynamics. That is, authors who opt to have their pages highly ranked for queries of interest, often respond to rankings for these queries by manipulating their pages; the goal is to improve the pages' future rankings. Various theoretical aspects of this dynamics have recently been studied using game theory. However, empirical analysis of the dynamics is highly constrained due to lack of publicly available datasets.We present an initial such dataset that is based on TREC's ClueWeb09 dataset. Specifically, we used the WayBack Machine of the Internet Archive to build a document collection that contains past snapshots of ClueWeb documents which are highly ranked by some initial search performed for ClueWeb queries. Temporal analysis of document changes in this dataset reveals that findings recently presented for small-scale controlled ranking competitions between documents' authors also hold for Web data. Specifically, documents' authors tend to mimic the content of documents that were highly ranked in the past, and this practice can result in improved ranking.

READ FULL TEXT
research
10/21/2021

Driving the Herd: Search Engines as Content Influencers

In competitive search settings such as the Web, many documents' authors ...
research
08/02/2023

A Large-Scale Study of Phishing PDF Documents

Phishing PDFs are malicious PDF documents that do not embed malware but ...
research
04/12/2018

Optimizing Query Evaluations using Reinforcement Learning for Web Search

In web search, typically a candidate generation step selects a small set...
research
06/12/2018

Ranking Robustness Under Adversarial Document Manipulations

For many queries in the Web retrieval setting there is an on-going ranki...
research
05/31/2023

Beyond Rankings: Exploring the Impact of SERP Features on Organic Click-through Rates

Search Engine Result Pages (SERPs) serve as the digital gateways to the ...
research
04/06/2021

Large-scale Sustainable Search on Unconventional Computing Hardware

Since the advent of the Internet, quantifying the relative importance of...
research
01/29/2020

ScreenTrack: Using a Visual History of a Computer Screen to Retrieve Documents and Web Pages

Computers are used for various purposes, so frequent context switching i...

Please sign up or login with your details

Forgot password? Click here to reset