Natural Scene Recognition Based on Superpixels and Deep Boltzmann Machines

06/24/2015
by   Jinfu Yang, et al.
0

The Deep Boltzmann Machines (DBM) is a state-of-the-art unsupervised learning model, which has been successfully applied to handwritten digit recognition and, as well as object recognition. However, the DBM is limited in scene recognition due to the fact that natural scene images are usually very large. In this paper, an efficient scene recognition approach is proposed based on superpixels and the DBMs. First, a simple linear iterative clustering (SLIC) algorithm is employed to generate superpixels of input images, where each superpixel is regarded as an input of a learning model. Then, a two-layer DBM model is constructed by stacking two restricted Boltzmann machines (RBMs), and a greedy layer-wise algorithm is applied to train the DBM model. Finally, a softmax regression is utilized to categorize scene images. The proposed technique can effectively reduce the computational complexity and enhance the performance for large natural image recognition. The approach is verified and evaluated by extensive experiments, including the fifteen-scene categories dataset the UIUC eight-sports dataset, and the SIFT flow dataset, are used to evaluate the proposed method. The experimental results show that the proposed approach outperforms other state-of-the-art methods in terms of recognition rate.

READ FULL TEXT

page 13

page 20

page 21

page 23

research
06/24/2015

A Novel Feature Extraction Method for Scene Recognition Based on Centered Convolutional Restricted Boltzmann Machines

Scene recognition is an important research topic in computer vision, whi...
research
07/22/2018

Deep Discriminative Model for Video Classification

This paper presents a new deep learning approach for video-based scene c...
research
10/16/2017

What is (missing or wrong) in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

Scene models allow robots to reason about what is in the scene, what els...
research
11/09/2022

Portmanteauing Features for Scene Text Recognition

Scene text images have different shapes and are subjected to various dis...
research
08/01/2021

BORM: Bayesian Object Relation Model for Indoor Scene Recognition

Scene recognition is a fundamental task in robotic perception. For human...
research
12/31/2017

Restricted Boltzmann Machines for Robust and Fast Latent Truth Discovery

We address the problem of latent truth discovery, LTD for short, where t...
research
02/06/2016

A Deep Learning Approach to Unsupervised Ensemble Learning

We show how deep learning methods can be applied in the context of crowd...

Please sign up or login with your details

Forgot password? Click here to reset