Incremental Learning of 3D-DCT Compact Representations for Robust Visual Tracking

07/14/2012
by   Xi Li, et al.
0

Visual tracking usually requires an object appearance model that is robust to changing illumination, pose and other factors encountered in video. In this paper, we construct an appearance model using the 3D discrete cosine transform (3D-DCT). The 3D-DCT is based on a set of cosine basis functions, which are determined by the dimensions of the 3D signal and thus independent of the input video data. In addition, the 3D-DCT can generate a compact energy spectrum whose high-frequency coefficients are sparse if the appearance samples are similar. By discarding these high-frequency coefficients, we simultaneously obtain a compact 3D-DCT based object representation and a signal reconstruction-based similarity measure (reflecting the information loss from signal reconstruction). To efficiently update the object representation, we propose an incremental 3D-DCT algorithm, which decomposes the 3D-DCT into successive operations of the 2D discrete cosine transform (2D-DCT) and 1D discrete cosine transform (1D-DCT) on the input video data.

READ FULL TEXT

page 7

page 8

page 10

page 11

page 12

page 13

page 14

page 15

research
04/18/2018

Reversible Video Data Hiding Using Zero QDCT Coefficient-Pairs

There exist many zero quantized discrete cosine transform (QDCT) coeffic...
research
11/09/2018

The discrete cosine transform on triangles

The discrete cosine transform is a valuable tool in analysis of data on ...
research
09/01/2023

Adaptive function approximation based on the Discrete Cosine Transform (DCT)

This paper studies the cosine as basis function for the approximation of...
research
05/10/2022

Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words

Cosine similarity of contextual embeddings is used in many NLP tasks (e....
research
12/18/2014

Data Representation using the Weyl Transform

The Weyl transform is introduced as a rich framework for data representa...
research
07/26/2018

Fast cosine transform for FCC lattices

Voxel representation and processing is an important issue in a broad spe...
research
06/08/2019

Frequency-Dependent Perceptual Quantisation for Visually Lossless Compression Applications

The default quantisation algorithms in the state-of-the-art High Efficie...

Please sign up or login with your details

Forgot password? Click here to reset