Thomas Defard, Aleksandr Setkov, Angelique Loesch, Romaric Audigier
We present a new framework for Patch Distribution Modeling, PaDiM, to concurrently detect and localize anomalies in images in a one-class learning setting. PaDiM makes use of a pretrained convolutional neural network (CNN) for patch embedding, and of multivariate Gaussian distributions to get a probabilistic representation of the normal class. It also exploits correlations between the different semantic levels of CNN to better localize anomalies. PaDiM outperforms current state-of-the-art approaches for both anomaly detection and localization on the MVTec AD and STC datasets. To match real-world visual industrial inspection, we extend the evaluation protocol to assess performance of anomaly localization algorithms on non-aligned dataset. The state-of-the-art performance and low complexity of PaDiM make it a good candidate for many industrial applications.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Anomaly Detection | Hyper-Kvasir Dataset | AUC | 0.923 | PaDiM |
| Anomaly Detection | LAG | AUC | 0.688 | PaDiM |
| Anomaly Detection | MVTec AD | Detection AUROC | 97.9 | PaDiM |
| Anomaly Detection | MVTec AD | Detection AUROC | 95.3 | PaDiM-WR50-Rd550 |
| Anomaly Detection | MVTec AD | FPS | 4.4 | PaDiM-WR50-Rd550 |
| Anomaly Detection | MVTec AD | Segmentation AUROC | 97.5 | PaDiM-WR50-Rd550 |
| Anomaly Detection | MVTec AD | Segmentation AUROC | 96.7 | PaDiM-R18-Rd100 |
| Anomaly Detection | VisA | Segmentation AUPRO (until 30% FPR) | 85.9 | PaDiM |