Benchmarking Scientific Image Forgery Detectors

05/26/2021
by   João P. Cardenuto, et al.
29

The scientific image integrity area presents a challenging research bottleneck, the lack of available datasets to design and evaluate forensic techniques. Its data sensitivity creates a legal hurdle that prevents one to rely on real tampered cases to build any sort of accessible forensic benchmark. To mitigate this bottleneck, we present an extendable open-source library that reproduces the most common image forgery operations reported by the research integrity community: duplication, retouching, and cleaning. Using this library and realistic scientific images, we create a large scientific forgery image benchmark (39,423 images) with an enriched ground-truth. In addition, concerned about the high number of retracted papers due to image duplication, this work evaluates the state-of-the-art copy-move detection methods in the proposed dataset, using a new metric that asserts consistent match detection between the source and the copied region. The dataset and source-code will be freely available upon acceptance of the paper.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 8

page 9

page 10

research
04/18/2017

LibOPT: An Open-Source Platform for Fast Prototyping Soft Optimization Techniques

Optimization techniques play an important role in several scientific and...
research
10/28/2020

Forgery Blind Inspection for Detecting Manipulations of Gel Electrophoresis Images

Recently, falsified images have been found in papers involved in researc...
research
02/03/2021

Learning to identify image manipulations in scientific publications

Adherence to scientific community standards ensures objectivity, clarity...
research
09/28/2022

Automatic Analysis of Available Source Code of Top Artificial Intelligence Conference Papers

Source code is essential for researchers to reproduce the methods and re...
research
08/30/2021

BioFors: A Large Biomedical Image Forensics Dataset

Research in media forensics has gained traction to combat the spread of ...
research
07/12/2023

CLAIMED – the open source framework for building coarse-grained operators for accelerated discovery in science

In modern data-driven science, reproducibility and reusability are key c...
research
11/09/2017

DLPaper2Code: Auto-generation of Code from Deep Learning Research Papers

With an abundance of research papers in deep learning, reproducibility o...

Please sign up or login with your details

Forgot password? Click here to reset