A Foreground Inference Network for Video Surveillance Using Multi-View Receptive Field

01/19/2018
by   Thangarajah Akilan, et al.
0

Foreground (FG) pixel labelling plays a vital role in video surveillance. Recent engineering solutions have attempted to exploit the efficacy of deep learning (DL) models initially targeted for image classification to deal with FG pixel labelling. One major drawback of such strategy is the lacking delineation of visual objects when training samples are limited. To grapple with this issue, we introduce a multi-view receptive field fully convolutional neural network (MV-FCN) that harness recent seminal ideas, such as, fully convolutional structure, inception modules, and residual networking. Therefrom, we implement a system in an encoder-decoder fashion that subsumes a core and two complementary feature flow paths. The model exploits inception modules at early and late stages with three different sizes of receptive fields to capture invariance at various scales. The features learned in the encoding phase are fused with appropriate feature maps in the decoding phase through residual connections for achieving enhanced spatial representation. These multi-view receptive fields and residual feature connections are expected to yield highly generalized features for an accurate pixel-wise FG region identification. It is, then, trained with database specific exemplary segmentations to predict desired FG objects. The comparative experimental results on eleven benchmark datasets validate that the proposed model achieves very competitive performance with the prior- and state-of-the-art algorithms. We also report that how well a transfer learning approach can be useful to enhance the performance of our proposed MV-FCN.

READ FULL TEXT

page 7

page 8

page 9

page 10

page 11

research
03/10/2017

Fast LIDAR-based Road Detection Using Fully Convolutional Neural Networks

In this work, a deep learning approach has been developed to carry out r...
research
12/02/2022

DWRSeg: Dilation-wise Residual Network for Real-time Semantic Segmentation

Real-time semantic segmentation has played an important role in intellig...
research
03/08/2023

FCN+: Global Receptive Convolution Makes FCN Great Again

Fully convolutional network (FCN) is a seminal work for semantic segment...
research
08/04/2020

Hyperspectral Image Classification with Spatial Consistence Using Fully Convolutional Spatial Propagation Network

In recent years, deep convolutional neural networks (CNNs) have shown im...
research
04/07/2019

A Dilated Inception Network for Visual Saliency Prediction

Recently, with the advent of deep convolutional neural networks (DCNN), ...
research
04/06/2023

Improving automatic endoscopic stone recognition using a multi-view fusion approach enhanced with two-step transfer learning

This contribution presents a deep-learning method for extracting and fus...
research
02/02/2014

Collaborative Receptive Field Learning

The challenge of object categorization in images is largely due to arbit...

Please sign up or login with your details

Forgot password? Click here to reset