Demographic differences in search engine use with implications for cohort selection

05/15/2018
by   Elad Yom-Tov, et al.
0

The correlation between the demographics of users and the text they write has been investigated through literary texts and, more recently, social media. However, differences pertaining to language use in search engines has not been thoroughly analyzed, especially for age and gender differences. Such differences are important especially due to the growing use of search engine data in the study of human health, where queries are used to identify patient populations. Using data from multiple general-purpose Internet search engines gathered over a period of one month we investigate the correlation between demography (age, gender, and income) and the text of queries submitted to search engines. Our results show that females and younger people use longer queries. This difference is such that females make approximately 25 more words. In the case of queries which identify users as having specific medical conditions we find that females make 50 and that this results in patient cohorts which are highly skewed in gender and age, compared to known gender balance. Our results indicate that studies where demographic representation is important, such as in the study of health aspect of users or when search engines are evaluated for fairness, care should be taken in the selection of search engine data so as to create a representative dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2021

The Matter of Chance: Auditing Web Search Results Related to the 2020 U.S. Presidential Primary Elections Across Six Search Engines

We examine how six search engines filter and rank information in relatio...
research
12/11/2017

Interactions between Health Searchers and Search Engines

The Web is an important resource for understanding and diagnosing medica...
research
10/26/2020

The Age-related Differences in Web Information Search Process

Older adults' need for quality health information has never been more cr...
research
12/02/2019

An Investigation of Biases in Web Search Engine Query Suggestions

Survey-based studies suggest that search engines are trusted more than s...
research
08/23/2022

Don't Take it Personally: Analyzing Gender and Age Differences in Ratings of Online Humor

Computational humor detection systems rarely model the subjectivity of h...
research
04/25/2023

Patterns of gender-specializing query reformulation

Users of search systems often reformulate their queries by adding query ...
research
06/28/2022

Fire Dragon and Unicorn Princess; Gender Stereotypes and Children's Products in Search Engine Responses

Search engines in e-commerce settings allow users to search, browse, and...

Please sign up or login with your details

Forgot password? Click here to reset