Weakly supervised collective feature learning from curated media

02/13/2018
by   Yusuke Mukuta, et al.
0

The current state-of-the-art in feature learning relies on the supervised learning of large-scale datasets consisting of target content items and their respective category labels. However, constructing such large-scale fully-labeled datasets generally requires painstaking manual effort. One possible solution to this problem is to employ community contributed text tags as weak labels, however, the concepts underlying a single text tag strongly depends on the users. We instead present a new paradigm for learning discriminative features by making full use of the human curation process on social networking services (SNSs). During the process of content curation, SNS users collect content items manually from various sources and group them by context, all for their own benefit. Due to the nature of this process, we can assume that (1) content items in the same group share the same semantic concept and (2) groups sharing the same images might have related semantic concepts. Through these insights, we can define human curated groups as weak labels from which our proposed framework can learn discriminative features as a representation in the space of semantic concepts the users intended when creating the groups. We show that this feature learning can be formulated as a problem of link prediction for a bipartite graph whose nodes corresponds to content items and human curated groups, and propose a novel method for feature learning based on sparse coding or network fine-tuning.

READ FULL TEXT

page 2

page 4

research
02/05/2015

Collaborative Feature Learning from Social Media

Image feature representation plays an essential role in image recognitio...
research
09/04/2019

Large-scale Tag-based Font Retrieval with Generative Feature Learning

Font selection is one of the most important steps in a design workflow. ...
research
07/27/2018

DeepLink: A Novel Link Prediction Framework based on Deep Learning

Recently, link prediction has attracted more attentions from various dis...
research
09/30/2020

Uncertainty Estimation For Community Standards Violation In Online Social Networks

Online Social Networks (OSNs) provide a platform for users to share thei...
research
03/20/2017

Twitter100k: A Real-world Dataset for Weakly Supervised Cross-Media Retrieval

This paper contributes a new large-scale dataset for weakly supervised c...
research
10/13/2021

TAG: Toward Accurate Social Media Content Tagging with a Concept Graph

Although conceptualization has been widely studied in semantics and know...

Please sign up or login with your details

Forgot password? Click here to reset