Hyperbolic Image-Text Representations

04/18/2023
by   Karan Desai, et al.
0

Visual and linguistic concepts naturally organize themselves in a hierarchy, where a textual concept “dog” entails all images that contain dogs. Despite being intuitive, current large-scale vision and language models such as CLIP do not explicitly capture such hierarchy. We propose MERU, a contrastive model that yields hyperbolic representations of images and text. Hyperbolic spaces have suitable geometric properties to embed tree-like data, so MERU can better capture the underlying hierarchy in image-text data. Our results show that MERU learns a highly interpretable representation space while being competitive with CLIP's performance on multi-modal tasks like image classification and image-text retrieval.

READ FULL TEXT

page 20

page 21

page 22

page 23

page 30

page 35

page 36

page 38

research
09/21/2022

Rethinking the compositionality of point clouds through regularization in the hyperbolic space

Point clouds of 3D objects exhibit an inherent compositional nature wher...
research
11/19/2015

Order-Embeddings of Images and Language

Hypernymy, textual entailment, and image captioning can be seen as speci...
research
04/21/2023

Hyperbolic Geometry in Computer Vision: A Survey

Hyperbolic geometry, a Riemannian manifold endowed with constant section...
research
01/21/2022

Enhancing Hyperbolic Graph Embeddings via Contrastive Learning

Recently, hyperbolic space has risen as a promising alternative for semi...
research
10/16/2022

HyperMiner: Topic Taxonomy Mining with Hyperbolic Embedding

Embedded topic models are able to learn interpretable topics even with l...
research
06/01/2023

Coneheads: Hierarchy Aware Attention

Attention networks such as transformers have achieved state-of-the-art p...
research
02/25/2021

How to represent part-whole hierarchies in a neural network

This paper does not describe a working system. Instead, it presents a si...

Please sign up or login with your details

Forgot password? Click here to reset