A general method for estimating the prevalence of Influenza-Like-Symptoms with Wikipedia data

10/28/2020
by   Giovanni De Toni, et al.
0

Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Being able to estimate in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this study, we show the feasibility of exploiting information about Wikipedia's page views of a selected group of articles and machine learning models to obtain accurate estimates of influenza-like illnesses incidence in four European countries: Italy, Germany, Belgium, and the Netherlands. We propose a novel language-agnostic method, based on two algorithms, Personalized PageRank and CycleRank, to automatically select the most relevant Wikipedia pages to be monitored without the need for expert supervision. We then show how our model is able to reach state-of-the-art results by comparing it with previous solutions.

READ FULL TEXT
research
03/26/2019

Detecting and Gauging Impact on Wikipedia Page Views

Understanding how various external campaigns or events affect readership...
research
10/14/2020

NwQM: A neural quality assessment framework for Wikipedia

Millions of people irrespective of socioeconomic and demographic backgro...
research
10/09/2017

Inspiration, Captivation, and Misdirection: Emergent Properties in Networks of Online Navigation

The World Wide Web (WWW) has fundamentally changed the ways billions of ...
research
09/01/2023

A Comparative Study of Reference Reliability in Multiple Language Editions of Wikipedia

Information presented in Wikipedia articles must be attributable to reli...
research
03/20/2019

A Graph-structured Dataset for Wikipedia Research

Wikipedia is a rich and invaluable source of information. Its central pl...
research
08/30/2023

Publishing Wikipedia usage data with strong privacy guarantees

For almost 20 years, the Wikimedia Foundation has been publishing statis...

Please sign up or login with your details

Forgot password? Click here to reset