TR2024-040

Long-Tailed Anomaly Detection with Learnable Class Names

- Ho, C.-H., Peng, K.-C., Vasconcelos, N., "Long-Tailed Anomaly Detection with Learnable Class Names", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2024.
  BibTeX TR2024-040 PDF Presentation
  - @inproceedings{Ho2024jun,
  - author = {Ho, Chih-Hui and Peng, Kuan-Chuan and Vasconcelos, Nuno},
  - title = {Long-Tailed Anomaly Detection with Learnable Class Names},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  - year = 2024,
  - month = jun,
  - url = {https://www.merl.com/publications/TR2024-040}
  - }
MERL Contact:
- Kuan-Chuan
  Peng
Research Areas:

Artificial Intelligence, Computer Vision, Machine Learning

Abstract:

Anomaly detection (AD) aims to identify defective images and localize their defects (if any). Ideally, AD models should be able to detect defects over many image classes; without relying on hard-coded class names that can be uninformative or inconsistent across datasets; learn without anomaly supervision; and be robust to the long-tailed distributions of real-world applications. To address these challenges, we formulate the problem of long-tailed AD by introducing several datasets with different levels of class imbalance and metrics for performance evaluation. We then propose a novel method, LTAD, to detect defects from multiple and long-tailed classes, without relying on dataset class names. LTAD combines AD by reconstruction and semantic AD modules. AD by reconstruction is implemented with a transformer-based reconstruction module. Semantic AD is implemented with a binary classifier, which relies on learned pseudo class names and a pretrained foundation model. These modules are learned over two phases. Phase 1 learns the pseudo-class names and a variational autoencoder (VAE) for feature synthesis that augments the training data to combat long-tails. Phase 2 then learns the parameters of the reconstruction and classification modules of LTAD. Extensive experiments using the proposed long-tailed datasets show that LTAD substantially outperforms the state-of-the-art methods for most forms of dataset imbalance. The long-tailed dataset split is available at https://zenodo.org/records/10854201.

Related Publication

Ho, C.-H., Peng, K.-C., Vasconcelos, N., "Long-Tailed Anomaly Detection with Learnable Class Names", arXiv, March 2024.

BibTeX arXiv

@article{Ho2024mar,
author = {Ho, Chih-Hui and Peng, Kuan-Chuan and Vasconcelos, Nuno},
title = {Long-Tailed Anomaly Detection with Learnable Class Names},
journal = {arXiv},
year = 2024,
month = mar,
url = {https://arxiv.org/abs/2403.20236}
}

MERL Contact:

Kuan-ChuanPeng

Research Areas:

Abstract:

Kuan-Chuan
Peng