A Hybrid Supervised-unsupervised Method on Image Topic Visualization with Convolutional Neural Network and LDA

03/15/2017
by   Kai Zhen, et al.
0

Given the progress in image recognition with recent data driven paradigms, it's still expensive to manually label a large training data to fit a convolutional neural network (CNN) model. This paper proposes a hybrid supervised-unsupervised method combining a pre-trained AlexNet with Latent Dirichlet Allocation (LDA) to extract image topics from both an unlabeled life-logging dataset and the COCO dataset. We generate the bag-of-words representations of an egocentric dataset from the softmax layer of AlexNet and use LDA to visualize the subject's living genre with duplicated images. We use a subset of COCO on 4 categories as ground truth, and define consistent rate to quantitatively analyze the performance of the method, it achieves 84 consistent rate on average comparing to 18.75 is capable of detecting false labels and multi-labels from COCO dataset. For scalability test, parallelization experiments are conducted with Harp-LDA on a Intel Knights Landing cluster: to extract 1,000 topic assignments for 241,035 COCO images, it takes 10 minutes with 60 threads.

READ FULL TEXT

page 1

page 2

page 5

page 6

page 7

page 8

research
01/22/2014

Parsimonious Topic Models with Salient Word Discovery

We propose a parsimonious topic model for text corpora. In related model...
research
03/06/2018

Categorical Mixture Models on VGGNet activations

In this project, I use unsupervised learning techniques in order to clus...
research
04/02/2019

Short Text Classification Improved by Feature Space Extension

With the explosive development of mobile Internet, short text has been a...
research
11/23/2020

LaHAR: Latent Human Activity Recognition using LDA

Processing sequential multi-sensor data becomes important in many tasks ...
research
04/23/2018

Discovering Style Trends through Deep Visually Aware Latent Item Embeddings

In this paper, we explore Latent Dirichlet Allocation (LDA) and Polyling...
research
04/26/2016

Entities as topic labels: Improving topic interpretability and evaluability combining Entity Linking and Labeled LDA

In order to create a corpus exploration method providing topics that are...
research
09/27/2018

Semantic Topic Analysis of Traffic Camera Images

Traffic cameras are commonly deployed monitoring components in road infr...

Please sign up or login with your details

Forgot password? Click here to reset