Guess What's on my Screen? Clustering Smartphone Screenshots with Active Learning

01/09/2019
by   Agnese Chiatti, et al.
0

A significant proportion of individuals' daily activities is experienced through digital devices. Smartphones in particular have become one of the preferred interfaces for content consumption and social interaction. Identifying the content embedded in frequently-captured smartphone screenshots is thus a crucial prerequisite to studies of media behavior and health intervention planning that analyze activity interplay and content switching over time. Screenshot images can depict heterogeneous contents and applications, making the a priori definition of adequate taxonomies a cumbersome task, even for humans. Privacy protection of the sensitive data captured on screens means the costs associated with manual annotation are large, as the effort cannot be crowd-sourced. Thus, there is need to examine utility of unsupervised and semi-supervised methods for digital screenshot classification. This work introduces the implications of applying clustering on large screenshot sets when only a limited amount of labels is available. In this paper we develop a framework for combining K-Means clustering with Active Learning for efficient leveraging of labeled and unlabeled samples, with the goal of discovering latent classes and describing a large collection of screenshot data. We tested whether SVM-embedded or XGBoost-embedded solutions for class probability propagation provide for more well-formed cluster configurations. Visual and textual vector representations of the screenshot images are derived and combined to assess the relative contribution of multi-modal features to the overall performance.

READ FULL TEXT
research
09/27/2019

Active Learning for Event Detection in Support of Disaster Analysis Applications

Disaster analysis in social media content is one of the interesting rese...
research
02/05/2020

Rényi Entropy Bounds on the Active Learning Cost-Performance Tradeoff

Semi-supervised classification, one of the most prominent fields in mach...
research
10/29/2020

PAL : Pretext-based Active Learning

When obtaining labels is expensive, the requirement of a large labeled t...
research
12/26/2021

Unsupervised Clustering Active Learning for Person Re-identification

Supervised person re-identification (re-id) approaches require a large a...
research
12/09/2020

Semi-supervised Active Learning for Instance Segmentation via Scoring Predictions

Active learning generally involves querying the most representative samp...
research
11/14/2019

Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision

Humans do not acquire perceptual abilities in the way we train machines....
research
03/16/2022

Motif Mining: Finding and Summarizing Remixed Image Content

On the internet, images are no longer static; they have become dynamic c...

Please sign up or login with your details

Forgot password? Click here to reset