Keypoint Encoding for Improved Feature Extraction from Compressed Video at Low Bitrates

06/27/2015
by   Jianshu Chao, et al.
0

In many mobile visual analysis applications, compressed video is transmitted over a communication network and analyzed by a server. Typical processing steps performed at the server include keypoint detection, descriptor calculation, and feature matching. Video compression has been shown to have an adverse effect on feature-matching performance. The negative impact of compression can be reduced by using the keypoints extracted from the uncompressed video to calculate descriptors from the compressed video. Based on this observation, we propose to provide these keypoints to the server as side information and to extract only the descriptors from the compressed video. First, we introduce four different frame types for keypoint encoding to address different types of changes in video content. These frame types represent a new scene, the same scene, a slowly changing scene, or a rapidly moving scene and are determined by comparing features between successive video frames. Then, we propose Intra, Skip and Inter modes of encoding the keypoints for different frame types. For example, keypoints for new scenes are encoded using the Intra mode, and keypoints for unchanged scenes are skipped. As a result, the bitrate of the side information related to keypoint encoding is significantly reduced. Finally, we present pairwise matching and image retrieval experiments conducted to evaluate the performance of the proposed approach using the Stanford mobile augmented reality dataset and 720p format videos. The results show that the proposed approach offers significantly improved feature matching and image retrieval performance at a given bitrate.

READ FULL TEXT

page 4

page 5

page 12

page 13

research
03/24/2015

Fast keypoint detection in video sequences

A number of computer vision tasks exploit a succinct representation of t...
research
06/08/2015

Circulant temporal encoding for video retrieval and temporal alignment

We address the problem of specific video event retrieval. Given a query ...
research
12/19/2016

Large-Scale Image Retrieval with Attentive Deep Local Features

We propose an attentive local feature descriptor suitable for large-scal...
research
05/27/2020

D2D: Keypoint Extraction with Describe to Detect Approach

In this paper, we present a novel approach that exploits the information...
research
05/03/2023

Learning-based Relational Object Matching Across Views

Intelligent robots require object-level scene understanding to reason ab...
research
02/26/2015

Coding local and global binary visual features extracted from video sequences

Binary local features represent an effective alternative to real-valued ...
research
05/09/2023

Unsupervised Writer Retrieval using NetRVLAD and Graph Similarity Reranking

This paper presents an unsupervised approach for writer retrieval based ...

Please sign up or login with your details

Forgot password? Click here to reset