Scene Recognition by Combining Local and Global Image Descriptors

02/21/2017
by   Jobin Wilson, et al.
0

Object recognition is an important problem in computer vision, having diverse applications. In this work, we construct an end-to-end scene recognition pipeline consisting of feature extraction, encoding, pooling and classification. Our approach simultaneously utilize global feature descriptors as well as local feature descriptors from images, to form a hybrid feature descriptor corresponding to each image. We utilize DAISY features associated with key points within images as our local feature descriptor and histogram of oriented gradients (HOG) corresponding to an entire image as a global descriptor. We make use of a bag-of-visual-words encoding and apply Mini- Batch K-Means algorithm to reduce the complexity of our feature encoding scheme. A 2-level pooling procedure is used to combine DAISY and HOG features corresponding to each image. Finally, we experiment with a multi-class SVM classifier with several kernels, in a cross-validation setting, and tabulate our results on the fifteen scene categories dataset. The average accuracy of our model was 76.4 training and testing datasets respectively. The primary objective of this work is to clearly outline the practical implementation of a basic screne-recognition pipeline having a reasonable accuracy, in python, using open-source libraries. A full implementation of the proposed model is available in our github repository.

READ FULL TEXT
research
09/11/2018

Facial Recognition with Encoded Local Projections

Encoded Local Projections (ELP) is a recently introduced dense sampling ...
research
10/01/2013

Classifying Traffic Scenes Using The GIST Image Descriptor

This paper investigates classification of traffic scenes in a very low b...
research
04/16/2020

A Local Descriptor with Physiological Characteristic for Finger Vein Recognition

Local feature descriptors exhibit great superiority in finger vein recog...
research
09/01/2016

Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition

Traditional feature encoding scheme (e.g., Fisher vector) with local des...
research
05/18/2014

Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice

Video based action recognition is one of the important and challenging p...
research
03/30/2019

A Convolution-Free LBP-HOG Descriptor For Mammogram Classification

In image based feature descriptor design, an iterative scanning process ...
research
10/26/2022

Fast and Efficient Scene Categorization for Autonomous Driving using VAEs

Scene categorization is a useful precursor task that provides prior know...

Please sign up or login with your details

Forgot password? Click here to reset