AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models

08/29/2023
by   Zhaopeng Gu, et al.
0

Large Vision-Language Models (LVLMs) such as MiniGPT-4 and LLaVA have demonstrated the capability of understanding images and achieved remarkable performance in various visual tasks. Despite their strong abilities in recognizing common objects due to extensive training datasets, they lack specific domain knowledge and have a weaker understanding of localized details within objects, which hinders their effectiveness in the Industrial Anomaly Detection (IAD) task. On the other hand, most existing IAD methods only provide anomaly scores and necessitate the manual setting of thresholds to distinguish between normal and abnormal samples, which restricts their practical implementation. In this paper, we explore the utilization of LVLM to address the IAD problem and propose AnomalyGPT, a novel IAD approach based on LVLM. We generate training data by simulating anomalous images and producing corresponding textual descriptions for each image. We also employ an image decoder to provide fine-grained semantic and design a prompt learner to fine-tune the LVLM using prompt embeddings. Our AnomalyGPT eliminates the need for manual threshold adjustments, thus directly assesses the presence and locations of anomalies. Additionally, AnomalyGPT supports multi-turn dialogues and exhibits impressive few-shot in-context learning capabilities. With only one normal shot, AnomalyGPT achieves the state-of-the-art performance with an accuracy of 86.1 on the MVTec-AD dataset. Code is available at https://github.com/CASIA-IVA-Lab/AnomalyGPT.

READ FULL TEXT

page 4

page 5

page 7

research
05/30/2023

AnoOnly: Semi-Supervised Anomaly Detection without Loss on Normal Data

Semi-supervised anomaly detection (SSAD) methods have demonstrated their...
research
10/09/2021

Focus Your Distribution: Coarse-to-Fine Non-Contrastive Learning for Anomaly Detection and Localization

The essence of unsupervised anomaly detection is to learn the compact di...
research
06/08/2022

Dual-Distribution Discrepancy for Anomaly Detection in Chest X-Rays

Chest X-ray (CXR) is the most typical radiological exam for diagnosis of...
research
11/25/2022

MAEDAY: MAE for few and zero shot AnomalY-Detection

The goal of Anomaly-Detection (AD) is to identify outliers, or outlying ...
research
08/22/2023

Random Word Data Augmentation with CLIP for Zero-Shot Anomaly Detection

This paper presents a novel method that leverages a visual-language mode...
research
05/30/2021

Defending Pre-trained Language Models from Adversarial Word Substitutions Without Performance Sacrifice

Pre-trained contextualized language models (PrLMs) have led to strong pe...
research
05/16/2021

How is BERT surprised? Layerwise detection of linguistic anomalies

Transformer language models have shown remarkable ability in detecting w...

Please sign up or login with your details

Forgot password? Click here to reset