Scene Classification in Indoor Environments for Robots using Context Based Word Embeddings

08/18/2019
by   Bao Xin Chen, et al.
4

Scene Classification has been addressed with numerous techniques in computer vision literature. However, with the increasing number of scene classes in datasets in the field, it has become difficult to achieve high accuracy in the context of robotics. In this paper, we implement an approach which combines traditional deep learning techniques with natural language processing methods to generate a word embedding based Scene Classification algorithm. We use the key idea that context (objects in the scene) of an image should be representative of the scene label meaning a group of objects could assist to predict the scene class. Objects present in the scene are represented by vectors and the images are re-classified based on the objects present in the scene to refine the initial classification by a Convolutional Neural Network (CNN). In our approach we address indoor Scene Classification task using a model trained with a reduced pre-processed version of the Places365 dataset and an empirical analysis is done on a real-world dataset that we built by capturing image sequences using a GoPro camera. We also report results obtained on a subset of the Places365 dataset using our approach and additionally show a deployment of our approach on a robot operating in a real-world environment.

READ FULL TEXT

page 2

page 6

research
11/24/2015

Searching for Objects using Structure in Indoor Scenes

To identify the location of objects of a particular class, a passive com...
research
03/07/2021

A Comparison of Word2Vec, HMM2Vec, and PCA2Vec for Malware Classification

Word embeddings are often used in natural language processing as a means...
research
09/25/2021

An embarrassingly simple comparison of machine learning algorithms for indoor scene classification

With the emergence of autonomous indoor robots, the computer vision task...
research
09/20/2020

Deriving Visual Semantics from Spatial Context: An Adaptation of LSA and Word2Vec to generate Object and Scene Embeddings from Images

Embeddings are an important tool for the representation of word meaning....
research
01/16/2019

UAN: Unified Attention Network for Convolutional Neural Networks

We propose a new architecture that learns to attend to different Convolu...
research
10/06/2015

Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification

Recent work on scene classification still makes use of generic CNN featu...
research
05/17/2021

Fast and Accurate Camera Scene Detection on Smartphones

AI-powered automatic camera scene detection mode is nowadays available i...

Please sign up or login with your details

Forgot password? Click here to reset