Improving Wikipedia Verifiability with AI

07/08/2022
by   Fabio Petroni, et al.
12

Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by citations. There are millions of articles available online and thousands of new articles are released each month. For this reason, finding relevant sources is a difficult task: many claims do not have any references that support them. Furthermore, even existing citations might not support a given claim or become obsolete once the original source is updated or deleted. Hence, maintaining and improving the quality of Wikipedia references is an important challenge and there is a pressing need for better tools to assist humans in this effort. Here, we show that the process of improving references can be tackled with the help of artificial intelligence (AI). We develop a neural network based system, called Side, to identify Wikipedia citations that are unlikely to support their claims, and subsequently recommend better ones from the web. We train this model on existing Wikipedia references, therefore learning from the contributions and combined wisdom of thousands of Wikipedia editors. Using crowd-sourcing, we observe that for the top 10 humans prefer our system's suggested alternatives compared to the originally cited reference 70 we built a demo to engage with the English-speaking Wikipedia community and find that Side's first citation recommendation collects over 60 preferences than existing Wikipedia citations for the same top 10 unverifiable claims according to Side. Our results indicate that an AI-based system could be used, in tandem with humans, to improve the verifiability of Wikipedia. More generally, we hope that our work can be used to assist fact checking efforts and increase the general trustworthiness of information online.

READ FULL TEXT

page 2

page 6

page 14

research
01/23/2020

Quantifying Engagement with Citations on Wikipedia

Wikipedia, the free online encyclopedia that anyone can edit, is one of ...
research
02/24/2021

References in Wikipedia: The Editors' Perspective

References are an essential part of Wikipedia. Each statement in Wikiped...
research
10/06/2020

'I Updated the <ref>': The Evolution of References in the English Wikipedia and the Implications for Altmetrics

With this work, we present a publicly available dataset of the history o...
research
08/07/2023

What has ChatGPT read? The origins of archaeological citations used by a generative artificial intelligence application

The public release of ChatGPT has resulted in considerable publicity and...
research
09/20/2021

Assessing the quality of sources in Wikidata across languages: a hybrid approach

Wikidata is one of the most important sources of structured data on the ...
research
12/13/2021

Surfer100: Generating Surveys From Web Resources on Wikipedia-style

Fast-developing fields such as Artificial Intelligence (AI) often outpac...
research
02/28/2019

Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia's Verifiability

Wikipedia is playing an increasingly central role on the web,and the pol...

Please sign up or login with your details

Forgot password? Click here to reset