Evaluation of Output Embeddings for Fine-Grained Image Classification

09/30/2014
by   Zeynep Akata, et al.
0

Image classification has advanced significantly in recent years with the availability of large-scale image sets. However, fine-grained classification remains a major challenge due to the annotation cost of large numbers of fine-grained categories. This project shows that compelling classification performance can be achieved on such categories even without labeled training data. Given image and class embeddings, we learn a compatibility function such that matching embeddings are assigned a higher score than mismatching ones; zero-shot classification of an image proceeds by finding the label yielding the highest joint compatibility score. We use state-of-the-art image features and focus on different supervised attributes and unsupervised output embeddings either derived from hierarchies or learned from unlabeled text corpora. We establish a substantially improved state-of-the-art on the Animals with Attributes and Caltech-UCSD Birds datasets. Most encouragingly, we demonstrate that purely unsupervised output embeddings (learned from Wikipedia and improved with fine-grained text) achieve compelling results, even outperforming the previous supervised state-of-the-art. By combining different output embeddings, we further improve results.

READ FULL TEXT
research
07/21/2023

Generating Image-Specific Text Improves Fine-grained Image Classification

Recent vision-language models outperform vision-only models on many imag...
research
03/29/2016

Latent Embeddings for Zero-shot Classification

We present a novel latent embedding model for learning a compatibility f...
research
07/20/2017

Revisiting Selectional Preferences for Coreference Resolution

Selectional preferences have long been claimed to be essential for coref...
research
07/05/2023

LOAF-M2L: Joint Learning of Wording and Formatting for Singable Melody-to-Lyric Generation

Despite previous efforts in melody-to-lyric generation research, there i...
research
11/28/2016

Gaze Embeddings for Zero-Shot Image Classification

Zero-shot image classification using auxiliary information, such as attr...
research
06/12/2019

Presence-Only Geographical Priors for Fine-Grained Image Classification

Appearance information alone is often not sufficient to accurately diffe...
research
08/25/2017

Nationality Classification Using Name Embeddings

Nationality identification unlocks important demographic information, wi...

Please sign up or login with your details

Forgot password? Click here to reset