Deep Neural Networks In Fully Connected CRF For Image Labeling With Social Network Metadata

01/27/2018
by   Chengjiang Long, et al.
0

We propose a novel method for predicting image labels by fusing image content descriptors with the social media context of each image. An image uploaded to a social media site such as Flickr often has meaningful, associated information, such as comments and other images the user has uploaded, that is complementary to pixel content and helpful in predicting labels. Prediction challenges such as ImageNet imagenet_cvpr09 and MSCOCO LinMBHPRDZ:ECCV14 use only pixels, while other methods make predictions purely from social media context McAuleyECCV12. Our method is based on a novel fully connected Conditional Random Field (CRF) framework, where each node is an image, and consists of two deep Convolutional Neural Networks (CNN) and one Recurrent Neural Network (RNN) that model both textual and visual node/image information. The edge weights of the CRF graph represent textual similarity and link-based metadata such as user sets and image groups. We model the CRF as an RNN for both learning and inference, and incorporate the weighted ranking loss and cross entropy loss into the CRF parameter optimization to handle the training data imbalance issue. Our proposed approach is evaluated on the MIR-9K dataset and experimentally outperforms current state-of-the-art approaches.

READ FULL TEXT

page 1

page 7

page 8

research
06/10/2019

Modeling Noisiness to Recognize Named Entities using Multitask Neural Networks on Social Media

Recognizing named entities in a document is a key task in many NLP appli...
research
10/13/2019

A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata

Images represent a commonly used form of visual communication among peop...
research
10/20/2012

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

Most state-of-the-art techniques for multi-class image segmentation and ...
research
10/24/2015

Combine CRF and MMSEG to Boost Chinese Word Segmentation in Social Media

In this paper, we propose a joint algorithm for the word segmentation on...
research
05/18/2021

Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media

Billions of photos are uploaded to the web daily through various types o...
research
10/31/2018

GraphIE: A Graph-Based Framework for Information Extraction

Most modern Information Extraction (IE) systems are implemented as seque...
research
04/07/2017

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits

Estimating correspondence between two images and extracting the foregrou...

Please sign up or login with your details

Forgot password? Click here to reset