Unsupervised Joint Mining of Deep Features and Image Labels for Large-scale Radiology Image Categorization and Scene Recognition

01/23/2017
by   Xiaosong Wang, et al.
0

The recent rapid and tremendous success of deep convolutional neural networks (CNN) on many challenging computer vision tasks largely derives from the accessibility of the well-annotated ImageNet and PASCAL VOC datasets. Nevertheless, unsupervised image categorization (i.e., without the ground-truth labeling) is much less investigated, yet critically important and difficult when annotations are extremely hard to obtain in the conventional way of "Google Search" and crowd sourcing. We address this problem by presenting a looped deep pseudo-task optimization (LDPO) framework for joint mining of deep CNN features and image labels. Our method is conceptually simple and rests upon the hypothesized "convergence" of better labels leading to better trained CNN models which in turn feed more discriminative image representations to facilitate more meaningful clusters/labels. Our proposed method is validated in tackling two important applications: 1) Large-scale medical image annotation has always been a prohibitively expensive and easily-biased task even for well-trained radiologists. Significantly better image categorization results are achieved via our proposed approach compared to the previous state-of-the-art method. 2) Unsupervised scene recognition on representative and publicly available datasets with our proposed technique is examined. The LDPO achieves excellent quantitative scene classification results. On the MIT indoor scene dataset, it attains a clustering accuracy of 75.3 the state-of-the-art supervised classification accuracy of 81.0 based on the VGG-VD model).

READ FULL TEXT
research
03/25/2016

Unsupervised Category Discovery via Looped Deep Pseudo-Task Optimization Using a Large Scale Radiology Image Database

Obtaining semantic labels on a large scale radiology image database (215...
research
09/30/2021

DeepMCAT: Large-Scale Deep Clustering for Medical Image Categorization

In recent years, the research landscape of machine learning in medical i...
research
12/25/2017

Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression

Despite recent progress, computational visual aesthetic is still challen...
research
10/11/2019

Shooting Labels: 3D Semantic Labeling by Virtual Reality

Availability of a few, large-size, annotated datasets, like ImageNet, Pa...
research
03/18/2019

An End-to-End Joint Unsupervised Learning of Deep Model and Pseudo-Classes for Remote Sensing Scene Representation

This work develops a novel end-to-end deep unsupervised learning method ...
research
03/29/2019

Local Aggregation for Unsupervised Learning of Visual Embeddings

Unsupervised approaches to learning in neural networks are of substantia...

Please sign up or login with your details

Forgot password? Click here to reset