Visual Recognition Using Directional Distribution Distance

04/19/2015
by   Jianxin Wu, et al.
0

In computer vision, an entity such as an image or video is often represented as a set of instance vectors, which can be SIFT, motion, or deep learning feature vectors extracted from different parts of that entity. Thus, it is essential to design efficient and effective methods to compare two sets of instance vectors. Existing methods such as FV, VLAD or Super Vectors have achieved excellent results. However, this paper shows that these methods are designed based on a generative perspective, and a discriminative method can be more effective in categorizing images or videos. The proposed D3 (discriminative distribution distance) method effectively compares two sets as two distributions, and proposes a directional total variation distance (DTVD) to measure how separated are they. Furthermore, a robust classifier-based method is proposed to estimate DTVD robustly. The D3 method is evaluated in action and image recognition tasks and has achieved excellent accuracy and speed. D3 also has a synergy with FV. The combination of D3 and FV has advantages over D3, FV, and VLAD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2018

Weighted Nonlocal Total Variation in Image Processing

In this paper, a novel weighted nonlocal total variation (WNTV) method i...
research
06/29/2011

Image denoising assessment using anisotropic stack filtering

In this paper we propose a measure of anisotropy as a quality parameter ...
research
06/13/2017

von Mises-Fisher Mixture Model-based Deep learning: Application to Face Verification

A number of pattern recognition tasks, e.g., face verification, can be b...
research
05/23/2022

Discriminative Feature Learning through Feature Distance Loss

Convolutional neural networks have shown remarkable ability to learn dis...
research
10/31/2016

A New Distance Measure for Non-Identical Data with Application to Image Classification

Distance measures are part and parcel of many computer vision algorithms...
research
12/23/2015

Mid-level Representation for Visual Recognition

Visual Recognition is one of the fundamental challenges in AI, where the...
research
08/28/2020

Nonlocal Adaptive Direction-Guided Structure Tensor Total Variation For Image Recovery

A common strategy in variational image recovery is utilizing the nonloca...

Please sign up or login with your details

Forgot password? Click here to reset