Deep Learning Method for Object Tracking, Velocity Estimation and Projection of Sensor Data over Time

06/08/2023
by   Marco Braun, et al.
0

Current Deep Learning methods for environment segmentation and velocity estimation rely on Convolutional Recurrent Neural Networks to exploit spatio-temporal relationships within obtained sensor data. These approaches derive scene dynamics implicitly by correlating novel input and memorized data utilizing ConvNets. We show how ConvNets suffer from architectural restrictions for this task. Based on these findings, we then provide solutions to various issues on exploiting spatio-temporal correlations in a sequence of sensor recordings by presenting a novel Recurrent Neural Network unit utilizing Transformer mechanisms. Within this unit, object encodings are tracked across consecutive frames by correlating key-query pairs derived from sensor inputs and memory states, respectively. We then use resulting tracking patterns to obtain scene dynamics and regress velocities. In a last step, the memory state of the Recurrent Neural Network is projected based on extracted velocity estimates to resolve aforementioned spatio-temporal misalignment.

READ FULL TEXT

page 1

page 6

research
02/02/2016

Deep Tracking: Seeing Beyond Seeing Using Recurrent Neural Networks

This paper presents to the best of our knowledge the first end-to-end ob...
research
11/17/2015

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Deep Recurrent Neural Network architectures, though remarkably capable a...
research
06/06/2016

Predictive Coding for Dynamic Vision : Development of Functional Hierarchy in a Multiple Spatio-Temporal Scales RNN Model

The current paper presents a novel recurrent neural network model, the p...
research
10/31/2016

Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks

We propose a convolutional recurrent neural network, with Winner-Take-Al...
research
04/11/2023

PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors

Conventional image sensors digitize high-resolution images at fast frame...
research
10/12/2021

Generalized Time Domain Velocity Vector

We introduce and analyze Generalized Time Domain Velocity Vector (GTVV),...
research
04/11/2023

Online Spatio-Temporal Learning with Target Projection

Recurrent neural networks trained with the backpropagation through time ...

Please sign up or login with your details

Forgot password? Click here to reset