Wikipedia graph mining: dynamic structure of collective memory

10/01/2017
by   Volodymyr Miz, et al.
0

Wikipedia is the biggest encyclopedia ever created and the fifth most visited website in the world. Tens of millions of people surf it every day, seeking answers to various questions. Collective user activity on its pages leaves publicly available footprints of human behavior, making Wikipedia an excellent source for analysis of collective behavior. In this work, we propose a distributed graph-based event extraction model, inspired by the Hebbian learning theory. The model exploits collective effect of the dynamics to discover events. We focus on data-streams with underlying graph structure and perform several large-scale experiments on the Wikipedia visitor activity data. We show that the presented model is scalable regarding time-series length and graph density, providing a distributed implementation of the proposed algorithm. We extract dynamical patterns of collective activity and demonstrate that they correspond to meaningful clusters of associated events, reflected in the Wikipedia articles. We also illustrate evolutionary dynamics of the graphs over time to highlight changing nature of visitors' interests. Finally, we discuss clusters of events that model collective recall process and represent collective memories - common memories shared by a group of people.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2021

Modeling Collective Anticipation and Response on Wikipedia

The dynamics of popularity in online media are driven by a combination o...
research
02/17/2020

What is Trending on Wikipedia? Capturing Trends and Language Biases Across Wikipedia Editions

In this work, we propose an automatic evaluation and comparison of the b...
research
01/15/2017

The Birth of Collective Memories: Analyzing Emerging Entities in Text Streams

We study how collective memories are formed online. We do so by tracking...
research
06/12/2021

Learngene: From Open-World to Your Learning Task

Although deep learning has made significant progress on fixed large-scal...
research
11/14/2022

Between News and History: Identifying Networked Topics of Collective Attention on Wikipedia

The digital information landscape has introduced a new dimension to unde...
research
03/30/2018

Building-up the subject classification system from the collective intelligence

Systematized subject classification is essential for funding and assessi...
research
03/30/2018

Build up of a subject classification system from collective intelligence

Systematized subject classification is essential for funding and assessi...

Please sign up or login with your details

Forgot password? Click here to reset