Cookie Synchronization: Everything You Always Wanted to Know But Were Afraid to Ask

05/26/2018
by   Panagiotis Papadopoulos, et al.
0

User data is the primary input of digital advertising, the fuel of free Internet as we know it. As a result, web entities invest a lot in elaborate tracking mechanisms to acquire more and more user data that can sell to data markets and advertisers. The primary identification mechanism of web is through cookies, where each entity assigns a userID on the user's side. However, each tracker knows the same user with a different ID. So how can the collected data be sold and merged with the associated user data of the buyer? To address this, Cookie Synchronization (CSync) came to the rescue. CSync facilitates an information sharing channel between third parties that may or may not have direct access to the website the user visits. With CSync, they merge the user data they own in the background, but also reconstruct the browsing history of a user bypassing the same origin policy. In this paper, we perform a first to our knowledge in-depth study of CSync in the wild, using a year-long dataset that includes web browsing activity from 850 real mobile users. Through our study, we aim to understand the characteristics of the CSync protocol and the impact it has to the users privacy. Our results show that 97 CSync: most of them within the first week of their browsing. In addition, the average user receives 1 synchronization per 68 GET requests, and the median userID gets leaked, on average, to 3.5 different online entities. In addition, we see that CSync increases the number of entities that track the user by a factor of 6.7. Finally, we propose a novel, machine learning-based method for CSync detection, which can be effective when the synced IDs are obscured.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2018

Cross-Device Tracking: Systematic Method to Detect and Measure CDT

Online advertising, the backbone of the free Web, has transformed the ma...
research
02/18/2019

Another Brick in the Paywall: The Popularity and Privacy Implications of Paywalls

Funding the production and distribution of quality online content is an ...
research
01/31/2022

Privacy Limitations Of Interest-based Advertising On The Web: A Post-mortem Empirical Analysis Of Google's FLoC

In 2020, Google announced they would disable third-party cookies in the ...
research
11/23/2018

Protecting User Privacy: An Approach for Untraceable Web Browsing History and Unambiguous User Profiles

The overturning of the Internet Privacy Rules by the Federal Communicati...
research
07/30/2019

Clash of the Trackers: Measuring the Evolution of the Online Tracking Ecosystem

Websites are constantly adapting the methods used, and intensity with wh...
research
06/08/2023

On the Robustness of Topics API to a Re-Identification Attack

Web tracking through third-party cookies is considered a threat to users...

Please sign up or login with your details

Forgot password? Click here to reset