Structure and Stability of Internet Top Lists

02/07/2018
by   Quirin Scheitle, et al.
0

Active Internet measurement studies rely on a list of targets to be scanned. While probing the entire IPv4 address space is feasible for scans of limited complexity, more complex scans do not scale to measuring the full Internet. Thus, a sample of the Internet can be used instead, often in form of a "top list". The most widely used list is the Alexa Global Top1M list. Despite their prevalence, use of top lists is seldomly questioned. Little is known about their creation, representativity, potential biases, stability, or overlap between lists. As a result, potential consequences of applying top lists in research are not known. In this study, we aim to open the discussion on top lists by investigating the aptness of frequently used top lists for empirical Internet scans, including stability, correlation, and potential biases of such lists.

READ FULL TEXT

page 1

page 2

research
05/29/2018

A Long Way to the Top: Significance, Structure, and Stability of Internet Top Lists

A broad range of research areas including Internet measurement, privacy,...
research
11/30/2021

Lists of Top Artists to Watch computed algorithmically

Lists of top artists to watch are periodically published by various art ...
research
05/12/2020

List homomorphism problems for signed graphs

We consider homomorphisms of signed graphs from a computational perspect...
research
06/04/2018

Automatically Generating a Large, Culture-Specific Blocklist for China

Internet censorship measurements rely on lists of websites to be tested,...
research
09/14/2021

Optimal To-Do List Gamification for Long Term Planning

Most people struggle with prioritizing work. While inexact heuristics ha...
research
04/13/2015

Egyptian Dialect Stopword List Generation from Social Network Data

This paper proposes a methodology for generating a stopword list from on...
research
02/22/2017

Guided Deep List: Automating the Generation of Epidemiological Line Lists from Open Sources

Real-time monitoring and responses to emerging public health threats rel...

Please sign up or login with your details

Forgot password? Click here to reset