Object Recognition from Short Videos for Robotic Perception

09/04/2015
by   Ivan Bogun, et al.
0

Deep neural networks have become the primary learning technique for object recognition. Videos, unlike still images, are temporally coherent which makes the application of deep networks non-trivial. Here, we investigate how motion can aid object recognition in short videos. Our approach is based on Long Short-Term Memory (LSTM) deep networks. Unlike previous applications of LSTMs, we implement each gate as a convolution. We show that convolutional-based LSTM models are capable of learning motion dependencies and are able to improve the recognition accuracy when more frames in a sequence are available. We evaluate our approach on the Washington RGBD Object dataset and on the Washington RGBD Scenes dataset. Our approach outperforms deep nets applied to still images and sets a new state-of-the-art in this domain.

READ FULL TEXT

page 5

page 6

research
06/10/2023

Bayesian and Neural Inference on LSTM-based Object Recognition from Tactile and Kinesthetic Information

Recent advances in the field of intelligent robotic manipulation pursue ...
research
12/19/2016

Few-Shot Object Recognition from Machine-Labeled Web Images

With the tremendous advances of Convolutional Neural Networks (ConvNets)...
research
03/31/2015

Beyond Short Snippets: Deep Networks for Video Classification

Convolutional neural networks (CNNs) have been extensively applied for i...
research
03/04/2019

TKD: Temporal Knowledge Distillation for Active Perception

Deep neural networks based methods have been proved to achieve outstandi...
research
04/14/2018

Video2Shop: Exactly Matching Clothes in Videos to Online Shopping Images

In recent years, both online retail and video hosting service are expone...
research
05/02/2020

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

In recent years, both online retail and video hosting service have been ...
research
03/18/2019

Direct Object Recognition Without Line-of-Sight Using Optical Coherence

Visual object recognition under situations in which the direct line-of-s...

Please sign up or login with your details

Forgot password? Click here to reset