
-
Probabilistic Embeddings for Cross-Modal Retrieval
Cross-modal retrieval methods build a common representation space for sa...
read it
-
Concept Generalization in Visual Representation Learning
Measuring concept generalization, i.e., the extent to which models train...
read it
-
StacMR: Scene-Text Aware Cross-Modal Retrieval
Recent models for cross-modal retrieval have benefited from an increasin...
read it
-
Continual Adaptation of Visual Representations via Domain Randomization and Meta-learning
Most standard learning approaches lead to fragile models which are prone...
read it
-
Hard Negative Mixing for Contrastive Learning
Contrastive learning has become a key component of self-supervised learn...
read it
-
Learning Visual Representations with Caption Annotations
Pretraining general-purpose visual features has become a crucial part of...
read it
-
Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings
We address the problem of cross-modal fine-grained action retrieval betw...
read it
-
Semi-convolutional Operators for Instance Segmentation
Object detection and instance segmentation are dominated by region-based...
read it
-
Self-supervised Learning of Geometrically Stable Features Through Probabilistic Introspection
Self-supervision can dramatically cut back the amount of manually-labell...
read it
-
Re-ID done right: towards good practices for person re-identification
Training a deep architecture using a ranking loss has become standard fo...
read it
-
Learning 3D Object Categories by Looking Around Them
Traditional approaches for learning 3D object categories use either synt...
read it
-
End-to-end Learning of Deep Visual Representations for Image Retrieval
While deep learning has become a key ingredient in the top performing me...
read it
-
Learning the semantic structure of objects from Web supervision
While recent research in image understanding has often focused on recogn...
read it
-
Deep Image Retrieval: Learning global representations for image search
We propose a novel approach for instance-level image retrieval. It produ...
read it
-
What is the right way to represent document images?
In this article we study the problem of document image representation ba...
read it
-
Understanding the Fisher Vector: a multimodal part model
Fisher Vectors and related orderless visual statistics have demonstrated...
read it
-
What makes an Image Iconic? A Fine-Grained Case Study
A natural approach to teaching a visual concept, e.g. a bird species, is...
read it
-
Incorporating Near-Infrared Information into Semantic Image Segmentation
Recent progress in computational photography has shown that we can acqui...
read it