Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

01/13/2021
by   Xiaoyi Zhang, et al.
0

Many accessibility features available on mobile platforms require applications (apps) to provide complete and accurate metadata describing user interface (UI) components. Unfortunately, many apps do not provide sufficient metadata for accessibility features to work as expected. In this paper, we explore inferring accessibility metadata for mobile apps from their pixels, as the visual interfaces often best reflect an app's full functionality. We trained a robust, fast, memory-efficient, on-device model to detect UI elements using a dataset of 77,637 screens (from 4,068 iPhone apps) that we collected and annotated. To further improve UI detections and add semantic information, we introduced heuristics (e.g., UI grouping and ordering) and additional models (e.g., recognize UI content, state, interactivity). We built Screen Recognition to generate accessibility metadata to augment iOS VoiceOver. In a study with 9 screen reader users, we validated that our approach improves the accessibility of existing mobile apps, enabling even previously inaccessible apps to be used.

READ FULL TEXT

page 2

page 8

page 10

research
08/17/2023

Never-ending Learning of User Interfaces

Machine learning models have been trained to predict semantic informatio...
research
09/15/2017

AndroZoo++: Collecting Millions of Android Apps and Their Metadata for the Research Community

We present a growing collection of Android apps collected from several s...
research
01/03/2019

Guigle: A GUI Search Engine for Android Apps

The process of developing a mobile application typically starts with the...
research
06/07/2023

Enhancing Virtual Assistant Intelligence: Precise Area Targeting for Instance-level User Intents beyond Metadata

Virtual assistants have been widely used by mobile phone users in recent...
research
11/07/2020

Identifying interception possibilities for WhatsApp communication

On a daily basis, law enforcement officers struggle with suspects using ...
research
02/28/2019

Modeling Mobile Interface Tappability Using Crowdsourcing and Deep Learning

Tapping is an immensely important gesture in mobile touchscreen interfac...
research
12/11/2022

Authoring Platform for Mobile Citizen Science Apps with Client-side ML

Data collection is an integral part of any citizen science project. Give...

Please sign up or login with your details

Forgot password? Click here to reset