Cheap-fake Detection with LLM using Prompt Engineering

06/05/2023
by   Guangyang Wu, et al.
0

The misuse of real photographs with conflicting image captions in news items is an example of the out-of-context (OOC) misuse of media. In order to detect OOC media, individuals must determine the accuracy of the statement and evaluate whether the triplet ( i.e., the image and two captions) relates to the same event. This paper presents a novel learnable approach for detecting OOC media in ICME'23 Grand Challenge on Detecting Cheapfakes. The proposed method is based on the COSMOS structure, which assesses the coherence between an image and captions, as well as between two captions. We enhance the baseline algorithm by incorporating a Large Language Model (LLM), GPT3.5, as a feature extractor. Specifically, we propose an innovative approach to feature extraction utilizing prompt engineering to develop a robust and reliable feature extractor with GPT3.5 model. The proposed method captures the correlation between two captions and effectively integrates this module into the COSMOS baseline model, which allows for a deeper understanding of the relationship between captions. By incorporating this module, we demonstrate the potential for significant improvements in cheap-fakes detection performance. The proposed methodology holds promising implications for various applications such as natural language processing, image captioning, and text-to-image synthesis. Docker for submission is available at https://hub.docker.com/repository/docker/mulns/ acmmmcheapfakes.

READ FULL TEXT
research
04/03/2023

Grand Challenge On Detecting Cheapfakes

Cheapfake is a recently coined term that encompasses non-AI ("cheap") ma...
research
07/29/2022

ACM Multimedia Grand Challenge on Detecting Cheapfakes

Cheapfake is a recently coined term that encompasses non-AI (“cheap”) ma...
research
11/15/2022

PromptCap: Prompt-Guided Task-Aware Image Captioning

Image captioning aims to describe an image with a natural language sente...
research
10/12/2016

Generating captions without looking beyond objects

This paper explores new evaluation perspectives for image captioning and...
research
07/06/2018

Face-Cap: Image Captioning using Facial Expression Analysis

Image captioning is the process of generating a natural language descrip...

Please sign up or login with your details

Forgot password? Click here to reset