The 2021 Image Similarity Dataset and Challenge

06/17/2021
by   Matthijs Douze, et al.
23

This paper introduces a new benchmark for large-scale image similarity detection. This benchmark is used for the Image Similarity Challenge at NeurIPS'21 (ISC2021). The goal is to determine whether a query image is a modified copy of any image in a reference corpus of size 1 million. The benchmark features a variety of image transformations such as automated transformations, hand-crafted image edits and machine-learning based manipulations. This mimics real-life cases appearing in social media, for example for integrity-related problems dealing with misinformation and objectionable content. The strength of the image manipulations, and therefore the difficulty of the benchmark, is calibrated according to the performance of a set of baseline approaches. Both the query and reference set contain a majority of “distractor” images that do not match, which corresponds to a real-life needle-in-haystack setting, and the evaluation metric reflects that. We expect the DISC21 benchmark to promote image copy detection as an important and challenging computer vision task and refresh the state of the art.

READ FULL TEXT

page 3

page 5

page 6

page 7

page 10

research
11/13/2021

D^2LV: A Data-Driven and Local-Verification Approach for Image Copy Detection

Image copy detection is of great importance in real-life social media. I...
research
06/15/2023

The 2023 Video Similarity Dataset and Challenge

This work introduces a dataset, benchmark, and challenge for the problem...
research
05/24/2022

A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection

Image copy detection (ICD) aims to determine whether a query image is an...
research
01/19/2018

Image Provenance Analysis at Scale

Prior art has shown it is possible to estimate, through image processing...
research
12/06/2021

Producing augmentation-invariant embeddings from real-life imagery

This article presents an efficient way to produce feature-rich, high-dim...
research
12/16/2019

PDQ TMK + PDQF – A Test Drive of Facebook's Perceptual Hashing Algorithms

Efficient and reliable automated detection of modified image and multime...
research
02/08/2022

Results and findings of the 2021 Image Similarity Challenge

The 2021 Image Similarity Challenge introduced a dataset to serve as a n...

Please sign up or login with your details

Forgot password? Click here to reset