Learning Generalized Zero-Shot Learners for Open-Domain Image Geolocalization

02/01/2023
by   Lukas Haas, et al.
0

Image geolocalization is the challenging task of predicting the geographic coordinates of origin for a given photo. It is an unsolved problem relying on the ability to combine visual clues with general knowledge about the world to make accurate predictions across geographies. We present $\href{https://huggingface.co/geolocal/StreetCLIP}{\text{StreetCLIP}}$, a robust, publicly available foundation model not only achieving state-of-the-art performance on multiple open-domain image geolocalization benchmarks but also doing so in a zero-shot setting, outperforming supervised models trained on more than 4 million images. Our method introduces a meta-learning approach for generalized zero-shot learning by pretraining CLIP from synthetic captions, grounding CLIP in a domain of choice. We show that our method effectively transfers CLIP's generalized zero-shot capabilities to the domain of image geolocalization, improving in-domain generalized zero-shot performance without finetuning StreetCLIP on a fixed set of classes.

READ FULL TEXT
research
07/03/2017

Zero-Shot Learning - A Comprehensive Evaluation of the Good, the Bad and the Ugly

Due to the importance of zero-shot learning, i.e. classifying images whe...
research
11/26/2021

Using Fictitious Class Representations to Boost Discriminative Zero-Shot Learners

Focusing on discriminative zero-shot learning, in this work we introduce...
research
06/13/2023

GeneCIS: A Benchmark for General Conditional Image Similarity

We argue that there are many notions of 'similarity' and that models, li...
research
08/08/2018

Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance

Individual neurons in convolutional neural networks supervised for image...
research
09/21/2023

Exploiting CLIP-based Multi-modal Approach for Artwork Classification and Retrieval

Given the recent advances in multimodal image pretraining where visual m...
research
05/01/2022

Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning

Medical coding (MC) is an essential pre-requisite for reliable data retr...
research
10/25/2022

OpenStance: Real-world Zero-shot Stance Detection

Prior studies of zero-shot stance detection identify the attitude of tex...

Please sign up or login with your details

Forgot password? Click here to reset