Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach

09/07/2017
by   Timnit Gebru, et al.
0

While fine-grained object recognition is an important problem in computer vision, current models are unlikely to accurately classify objects in the wild. These fully supervised models need additional annotated images to classify objects in every new scenario, a task that is infeasible. However, sources such as e-commerce websites and field guides provide annotated images for many classes. In this work, we study fine-grained domain adaptation as a step towards overcoming the dataset shift between easily acquired annotated images and the real world. Adaptation has not been studied in the fine-grained setting where annotations such as attributes could be used to increase performance. Our work uses an attribute based multi-task adaptation loss to increase accuracy from a baseline of 4.1 do- main adaptation works have been benchmarked on small datasets such as [46] with a total of 795 images for some domains, or simplistic datasets such as [41] consisting of digits. We perform experiments on a subset of a new challenging fine-grained dataset consisting of 1,095,021 images of 2, 657 car categories drawn from e-commerce web- sites and Google Street View.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
07/31/2018

Improving the Annotation of DeepFashion Images for Fine-grained Attribute Recognition

DeepFashion is a widely used clothing dataset with 50 categories and mor...
research
10/12/2016

Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes

Recognising detailed clothing characteristics (fine-grained attributes) ...
research
10/22/2021

Domain Adaptation and Active Learning for Fine-Grained Recognition in the Field of Biodiversity

Deep-learning methods offer unsurpassed recognition performance in a wid...
research
08/31/2017

Identifying Products in Online Cybercrime Marketplaces: A Dataset for Fine-grained Domain Adaptation

One weakness of machine-learned NLP models is that they typically perfor...
research
11/27/2018

Part-level Car Parsing and Reconstruction from Single Street View

In this paper, we make the first attempt to build a framework to simulta...
research
09/07/2017

Fine-Grained Car Detection for Visual Census Estimation

Targeted socioeconomic policies require an accurate understanding of a c...
research
10/13/2022

M2D2: A Massively Multi-domain Language Modeling Dataset

We present M2D2, a fine-grained, massively multi-domain corpus for study...

Please sign up or login with your details

Forgot password? Click here to reset