Discovering Visual Concept Structure with Sparse and Incomplete Tags

05/30/2017
by   Jingya Wang, et al.
0

Discovering automatically the semantic structure of tagged visual data (e.g. web videos and images) is important for visual data analysis and interpretation, enabling the machine intelligence for effectively processing the fast-growing amount of multi-media data. However, this is non-trivial due to the need for jointly learning underlying correlations between heterogeneous visual and tag data. The task is made more challenging by inherently sparse and incomplete tags. In this work, we develop a method for modelling the inherent visual data concept structures based on a novel Hierarchical-Multi-Label Random Forest model capable of correlating structured visual and tag information so as to more accurately interpret the visual semantics, e.g. disclosing meaningful visual groups with similar high-level concepts, and recovering missing tags for individual visual data samples. Specifically, our model exploits hierarchically structured tags of different semantic abstractness and multiple tag statistical correlations in addition to modelling visual and tag interactions. As a result, our model is able to discover more accurate semantic correlation between textual tags and visual features, and finally providing favourable visual semantics interpretation even with highly sparse and incomplete tags. We demonstrate the advantages of our proposed approach in two fundamental applications, visual data clustering and missing tag completion, on benchmarking video (i.e. TRECVID MED 2011) and image (i.e. NUS-WIDE) datasets.

READ FULL TEXT

page 2

page 5

page 9

page 12

page 14

page 17

research
09/26/2017

Understanding Infographics through Textual and Visual Tag Prediction

We introduce the problem of visual hashtag discovery for infographics: e...
research
08/16/2020

From Lost to Found: Discover Missing UI Design Semantics through Recovering Missing Tags

Design sharing sites provide UI designers with a platform to share their...
research
01/13/2015

Learning from Multiple Sources for Video Summarisation

Many visual surveillance tasks, e.g.video summarisation, is conventional...
research
08/22/2017

Tags2Parts: Discovering Semantic Regions from Shape Tags

We propose a novel method for discovering shape regions that strongly co...
research
01/12/2016

Subspace Clustering Based Tag Sharing for Inductive Tag Matrix Refinement with Complex Errors

Annotating images with tags is useful for indexing and retrieving images...
research
07/30/2013

Hybrid Affinity Propagation

In this paper, we address a problem of managing tagged images with hybri...
research
12/18/2012

A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics

This paper investigates the problem of modeling Internet images and asso...

Please sign up or login with your details

Forgot password? Click here to reset