OpenCL-accelerated object classification in video streams using Spatial Pooler of Hierarchical Temporal Memory

08/05/2016
by   Maciej Wielgosz, et al.
0

We present a method to classify objects in video streams using a brain-inspired Hierarchical Temporal Memory (HTM) algorithm. Object classification is a challenging task where humans still significantly outperform machine learning algorithms due to their unique capabilities. We have implemented a system which achieves very promising performance in terms of recognition accuracy. Unfortunately, conducting more advanced experiments is very computationally demanding; some of the trials run on a standard CPU may take as long as several days for 960x540 video streams frames. Therefore we have decided to accelerate selected parts of the system using OpenCL. In particular, we seek to determine to what extent porting selected and computationally demanding parts of a core may speed up calculations. The classification accuracy of the system was examined through a series of experiments and the performance was given in terms of F1 score as a function of the number of columns, synapses, min_overlap and winners_set_size. The system achieves the highest F1 score of 0.95 and 0.91 for min_overlap=4 and 256 synapses, respectively. We have also conduced a series of experiments with different hardware setups and measured CPU/GPU acceleration. The best kernel speed-up of 632x and 207x was reached for 256 synapses and 1024 columns. However, overall acceleration including transfer time was significantly lower and amounted to 6.5x and 3.2x for the same setup.

READ FULL TEXT

page 6

page 9

page 10

page 13

research
09/10/2016

Using Spatial Pooler of Hierarchical Temporal Memory to classify noisy videos with predefined complexity

This paper examines the performance of a Spatial Pooler (SP) of a Hierar...
research
07/08/2020

Quality Classification of Defective Parts from Injection Moulding

This report examines machine learning algorithms for detecting short for...
research
08/19/2020

SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras

Activity detection from first-person videos (FPV) captured using a weara...
research
08/17/2018

Neuromorphic Architecture for the Hierarchical Temporal Memory

A biomimetic machine intelligence algorithm, that holds promise in creat...
research
11/26/2013

Digitize Your Body and Action in 3-D at Over 10 FPS: Real Time Dense Voxel Reconstruction and Marker-less Motion Tracking via GPU Acceleration

In this paper, we present an approach to reconstruct 3-D human motion fr...
research
07/24/2019

QRMODA and BRMODA: Novel Models for Face Recognition Accuracy in Computer Vision Systems with Adapted Video Streams

A major challenge facing Computer Vision systems is providing the abilit...

Please sign up or login with your details

Forgot password? Click here to reset