Measuring the Importance of User-Generated Content to Search Engines

06/20/2019
by   Nicholas Vincent, et al.
0

Search engines are some of the most popular and profitable intelligent technologies in existence. Recent research, however, has suggested that search engines may be surprisingly dependent on user-created content like Wikipedia articles to address user information needs. In this paper, we perform a rigorous audit of the extent to which Google leverages Wikipedia and other user-generated content to respond to queries. Analyzing results for six types of important queries (e.g. most popular, trending, expensive advertising), we observe that Wikipedia appears in over 80 types and is by far the most prevalent individual content source across all query types. More generally, our results provide empirical information to inform a nascent but rapidly-growing debate surrounding a highly-consequential question: Do users provide enough value to intelligent technologies that they should receive more of the economic benefits from intelligent technologies?

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2020

A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search Engines

A growing body of work has highlighted the important role that Wikipedia...
research
02/15/2021

On the Value of Wikipedia as a Gateway to the Web

By linking to external websites, Wikipedia can act as a gateway to the W...
research
02/17/2017

Why We Read Wikipedia

Wikipedia is one of the most popular sites on the Web, with millions of ...
research
05/22/2023

The Dimensions of Data Labor: A Road Map for Researchers, Activists, and Policymakers to Empower Data Producers

Many recent technological advances (e.g. ChatGPT and search engines) are...
research
10/31/2022

Learning to Navigate Wikipedia by Taking Random Walks

A fundamental ability of an intelligent web-based agent is seeking out a...
research
10/21/2021

Driving the Herd: Search Engines as Content Influencers

In competitive search settings such as the Web, many documents' authors ...
research
12/19/2017

A Production Oriented Approach for Vandalism Detection in Wikidata - The Buffaloberry Vandalism Detector at WSDM Cup 2017

Wikidata is a free and open knowledge base from the Wikimedia Foundation...

Please sign up or login with your details

Forgot password? Click here to reset