A Long Way to the Top: Significance, Structure, and Stability of Internet Top Lists

05/29/2018
by   Quirin Scheitle, et al.
0

A broad range of research areas including Internet measurement, privacy, and network security rely on lists of target domains to be analysed; researchers make use of target lists for reasons of necessity or efficiency. The popular Alexa list of one million domains is a widely used example. Despite their prevalence in research papers, the soundness of top lists has seldom been questioned by the community: little is known about the lists' creation, representativity, potential biases, stability, or overlap between lists. In this study we survey the extent, nature, and evolution of top lists used by research communities. We assess the structure and stability of these lists, and show that rank manipulation is possible for some lists. We also reproduce the results of several scientific studies to assess the impact of using a top list at all, which list specifically, and the date of list creation. We find that (i) top lists generally overestimate results compared to the general population by a significant margin, often even a magnitude, and (ii) some top lists have surprising change characteristics, causing high day-to-day fluctuation and leading to result instability. We conclude our paper with specific recommendations on the use of top lists, and how to interpret results based on top lists with caution.

READ FULL TEXT
research
02/07/2018

Structure and Stability of Internet Top Lists

Active Internet measurement studies rely on a list of targets to be scan...
research
05/12/2020

List homomorphism problems for signed graphs

We consider homomorphisms of signed graphs from a computational perspect...
research
03/01/2006

Towards a better list of citation superstars: compiling a multidisciplinary list of highly cited researchers

A new approach to producing multidisciplinary lists of highly cited rese...
research
01/10/2018

Buying Online - A Characterization of Rational Buying Procedures

In decision theory, an agent chooses from a set of alternatives. When bu...
research
09/14/2021

Optimal To-Do List Gamification for Long Term Planning

Most people struggle with prioritizing work. While inexact heuristics ha...
research
04/12/2021

Measurements of the Most Significant Software Security Weaknesses

In this work, we provide a metric to calculate the most significant soft...
research
06/04/2018

CensorSeeker: Generating a Large, Culture-Specific Blocklist for China

Internet censorship measurements rely on lists of websites to be tested,...

Please sign up or login with your details

Forgot password? Click here to reset