Analysing and strengthening OpenWPM's reliability

05/18/2022
by   Benjamin Krumnow, et al.
0

Automated browsers are widely used to study the web at scale. Their premise is that they measure what regular browsers would encounter on the web. In practice, deviations due to detection of automation have been found. To what extent automated browsers can be improved to reduce such deviations has so far not been investigated in detail. In this paper, we investigate this for a specific web automation framework: OpenWPM, a popular research framework specifically designed to study web privacy. We analyse (1) detectability of OpenWPM, (2) prevalence of OpenWPM detection, and (3) integrity of OpenWPM's data recording. Our analysis reveals OpenWPM is easily detectable. We measure to what extent fingerprint-based detection is already leveraged against OpenWPM clients on 100,000 sites and observe that it is commonly detected ( 14 Moreover, we discover integrated routines in scripts to specifically detect OpenWPM clients. Our investigation of OpenWPM's data recording integrity identifies novel evasion techniques and previously unknown attacks against OpenWPM's instrumentation. We investigate and develop mitigations to address the identified issues. In conclusion, we find that reliability of automation frameworks should not be taken for granted. Identifiability of such frameworks should be studied, and mitigations deployed, to improve reliability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/03/2023

The Rise of Disappearing Frameworks in Web Development

The evolution of the web can be characterized as an emergence of framewo...
research
12/21/2019

Cached and Confused: Web Cache Deception in the Wild

Web cache deception (WCD) is an attack proposed in 2017, where an attack...
research
06/21/2019

Joint Detection of Malicious Domains and Infected Clients

Detection of malware-infected computers and detection of malicious web d...
research
08/03/2022

Image-based Detection of Surface Defects in Concrete during Construction

Defects increase the cost and duration of construction projects. Automat...
research
03/14/2018

Machine learning-assisted virtual patching of web applications

Web applications are permanently being exposed to attacks that exploit t...
research
08/10/2023

DiLogics: Creating Web Automation Programs With Diverse Logics

Knowledge workers frequently encounter repetitive web data entry tasks, ...

Please sign up or login with your details

Forgot password? Click here to reset