Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task

08/09/2016
by   Ashkan Mokarian, et al.
0

We present Mean Box Pooling, a novel visual representation that pools over CNN representations of a large number, highly overlapping object proposals. We show that such representation together with nCCA, a successful multimodal embedding technique, achieves state-of-the-art performance on the Visual Madlibs task. Moreover, inspired by the nCCA's objective function, we extend classical CNN+LSTM approach to train the network by directly maximizing the similarity between the internal representation of the deep learning architecture and candidate answers. Again, such approach achieves a significant improvement over the prior work that also uses CNN+LSTM approach on Visual Madlibs.

READ FULL TEXT

page 1

page 6

research
03/29/2021

Adaptive Methods for Real-World Domain Generalization

Invariant approaches have been remarkably successful in tackling the pro...
research
05/17/2022

A CLIP-Hitchhiker's Guide to Long Video Retrieval

Our goal in this paper is the adaptation of image-text models for long v...
research
04/24/2015

Object Level Deep Feature Pooling for Compact Image Representation

Convolutional Neural Network (CNN) features have been successfully emplo...
research
03/03/2017

Context Aware Query Image Representation for Particular Object Retrieval

The current models of image representation based on Convolutional Neural...
research
11/17/2019

ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

The task of fine-grained visual classification (FGVC) deals with classif...
research
12/19/2018

CNN based Multi-Instance Multi-Task Learning for Syndrome Differentiation of Diabetic Patients

Syndrome differentiation in Traditional Chinese Medicine (TCM) is the pr...
research
11/15/2019

Improving Graph Neural Network Representations of Logical Formulae with Subgraph Pooling

Recent advances in the integration of deep learning with automated theor...

Please sign up or login with your details

Forgot password? Click here to reset