Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection

06/15/2023
by   Yunkang Cao, et al.
0

This technical report introduces the winning solution of the team \textit{Segment Any Anomaly} for the CVPR2023 Visual Anomaly and Novelty Detection (VAND) challenge. Going beyond uni-modal prompt, \textit{e.g.}, language prompt, we present a novel framework, \textit{i.e.}, Segment Any Anomaly + (SAA$+$), for zero-shot anomaly segmentation with multi-modal prompts for the regularization of cascaded modern foundation models. Inspired by the great zero-shot generalization ability of foundation models like Segment Anything, we first explore their assembly (SAA) to leverage diverse multi-modal prior knowledge for anomaly localization. Subsequently, we further introduce multimodal prompts (SAA$+$) derived from domain expert knowledge and target image context to enable the non-parameter adaptation of foundation models to anomaly segmentation. The proposed SAA$+$ model achieves state-of-the-art performance on several anomaly segmentation benchmarks, including VisA and MVTec-AD, in the zero-shot setting. We will release the code of our winning solution for the CVPR2023 VAND challenge at \href{Segment-Any-Anomaly}{https://github.com/caoyunkang/Segment-Any-Anomaly} \footnote{The extended-version paper with more details is available at ~\cite{cao2023segment}.}

READ FULL TEXT

page 1

page 3

page 5

research
05/18/2023

Segment Any Anomaly without Training via Hybrid Prompt Regularization

We present a novel framework, i.e., Segment Any Anomaly + (SAA+), for ze...
research
02/15/2023

Zero-Shot Anomaly Detection without Foundation Models

Anomaly detection (AD) tries to identify data instances that deviate fro...
research
04/26/2023

Learnable Ophthalmology SAM

Segmentation is vital for ophthalmology image analysis. But its various ...
research
04/13/2023

High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis

We propose a novel method for Zero-Shot Anomaly Localization that levera...
research
06/30/2023

Topological Data Analysis Guided Segment Anything Model Prompt Optimization for Zero-Shot Segmentation in Biological Imaging

Emerging foundation models in machine learning are models trained on vas...
research
04/12/2023

Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation

With the continuous improvement of computing power and deep learning alg...

Please sign up or login with your details

Forgot password? Click here to reset