5th Place Solution to Kaggle Google Universal Image Embedding Competition

10/18/2022
by   Noriaki Ota, et al.
0

In this paper, we present our solution, which placed 5th in the kaggle Google Universal Image Embedding Competition in 2022. We use the ViT-H visual encoder of CLIP from the openclip repository as a backbone and train a head model composed of BatchNormalization and Linear layers using ArcFace. The dataset used was a subset of products10K, GLDv2, GPR1200, and Food101. And applying TTA for part of images also improves the score. With this method, we achieve a score of 0.684 on the public and 0.688 on the private leaderboard. Our code is available. https://github.com/riron1206/kaggle-Google-Universal-Image-Embedding-Competition-5th-Place-Solution

READ FULL TEXT

page 1

page 2

page 3

research
10/14/2022

3rd Place Solution for Google Universal Image Embedding

This paper presents the 3rd place solution to the Google Universal Image...
research
10/17/2022

2nd Place Solution to Google Universal Image Embedding

Image representations are a critical building block of computer vision a...
research
10/16/2022

1st Place Solution in Google Universal Images Embedding

This paper presents the 1st place solution for the Google Universal Imag...
research
10/17/2022

6th Place Solution to Google Universal Image Embedding

This paper presents the 6th place solution to the Google Universal Image...
research
10/08/2021

2nd Place Solution to Google Landmark Retrieval 2021

This paper presents the 2nd place solution to the Google Landmark Retrie...
research
10/18/2022

Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation

If one sees the place name Houston Mercer Dog Run in New York, how does ...
research
05/28/2021

2nd Place Solution for IJCAI-PRICAI 2020 3D AI Challenge: 3D Object Reconstruction from A Single Image

In this paper, we present our solution for the IJCAI–PRICAI–20 3D AI Cha...

Please sign up or login with your details

Forgot password? Click here to reset