WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

01/30/2023
by   Jason Wu, et al.
0

Modeling user interfaces (UIs) from visual information allows systems to make inferences about the functionality and semantics needed to support use cases in accessibility, app automation, and testing. Current datasets for training machine learning models are limited in size due to the costly and time-consuming process of manually collecting and annotating UIs. We crawled the web to construct WebUI, a large dataset of 400,000 rendered web pages associated with automatically extracted metadata. We analyze the composition of WebUI and show that while automatically extracted data is noisy, most examples meet basic criteria for visual UI modeling. We applied several strategies for incorporating semantics found in web pages to increase the performance of visual UI understanding models in the mobile domain, where less labeled data is available: (i) element detection, (ii) screen classification and (iii) screen similarity.

READ FULL TEXT

page 8

page 10

page 13

research
08/17/2023

Never-ending Learning of User Interfaces

Machine learning models have been trained to predict semantic informatio...
research
05/15/2021

A Large Visual, Qualitative and Quantitative Dataset of Web Pages

The World Wide Web is not only one of the most important platforms of co...
research
06/20/2021

To Block or Not to Block: Accelerating Mobile Web Pages On-The-Fly Through JavaScript Classification

The increasing complexity of JavaScript in modern mobile web pages has b...
research
10/22/2020

What is Web Scraping: Introduction, Applications and Best Practices

Web scraping typically extracts large amounts of #data from #websites fo...
research
01/20/2023

Screen Correspondence: Mapping Interchangeable Elements between UIs

Understanding user interface (UI) functionality is a useful yet challeng...
research
11/03/2021

The Klarna Product Page Dataset: A Realistic Benchmark for Web Representation Learning

This paper tackles the under-explored problem of DOM tree element repres...
research
05/25/2021

Understanding Mobile GUI: from Pixel-Words to Screen-Sentences

The ubiquity of mobile phones makes mobile GUI understanding an importan...

Please sign up or login with your details

Forgot password? Click here to reset