Improved Bilinear Pooling with CNNs

07/21/2017
by   Tsung-Yu Lin, et al.
0

Bilinear pooling of Convolutional Neural Network (CNN) features [22, 23], and their compact variants [10], have been shown to be effective at fine-grained recognition, scene categorization, texture recognition, and visual question-answering tasks among others. The resulting representation captures second-order statistics of convolutional features in a translationally invariant manner. In this paper we investigate various ways of normalizing these statistics to improve their representation power. In particular we find that the matrix square-root normalization offers significant improvements and outperforms alternative schemes such as the matrix logarithm normalization when combined with elementwise square-root and l2 normalization. This improves the accuracy by 2-3 new state of the art. We also investigate how the accuracy of matrix function computations effect network training and evaluation. In particular we compare against a technique for estimating matrix square-root gradients via solving a Lyapunov equation that is more numerically accurate than computing gradients via a Singular Value Decomposition (SVD). We find that while SVD gradients are numerically inaccurate the overall effect on the final accuracy is negligible once boundary cases are handled carefully. We present an alternative scheme for computing gradients that is faster and yet it offers improvements over the baseline model. Finally we show that the matrix square-root computed approximately using a few Newton iterations is just as accurate for the classification task but allows an order-of-magnitude faster GPU implementation compared to SVD decomposition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2021

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?

Global covariance pooling (GCP) aims at exploiting the second-order stat...
research
12/04/2017

Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization

Global covariance pooling in Convolutional neural neworks has achieved i...
research
06/05/2019

Compact Approximation for Polynomial of Covariance Feature

Covariance pooling is a feature pooling method with good classification ...
research
01/21/2022

Fast Differentiable Matrix Square Root

Computing the matrix square root or its inverse in a differentiable mann...
research
08/22/2018

Second-order Democratic Aggregation

Aggregated second-order features extracted from deep convolutional netwo...
research
03/30/2020

Multi-Objective Matrix Normalization for Fine-grained Visual Recognition

Bilinear pooling achieves great success in fine-grained visual recogniti...
research
04/29/2015

Bilinear CNNs for Fine-grained Visual Recognition

We present a simple and effective architecture for fine-grained visual r...

Please sign up or login with your details

Forgot password? Click here to reset