TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning JPEG Compression Artifacts for Image Manipulation...

Learning JPEG Compression Artifacts for Image Manipulation Detection and Localization

Myung-Joon Kwon, Seung-Hun Nam, In-Jae Yu, Heung-Kyu Lee, Changick Kim

2021-08-30Image Manipulation LocalizationImage ManipulationImage Manipulation Detection
PaperPDFCode(official)

Abstract

Detecting and localizing image manipulation are necessary to counter malicious use of image editing techniques. Accordingly, it is essential to distinguish between authentic and tampered regions by analyzing intrinsic statistics in an image. We focus on JPEG compression artifacts left during image acquisition and editing. We propose a convolutional neural network (CNN) that uses discrete cosine transform (DCT) coefficients, where compression artifacts remain, to localize image manipulation. Standard CNNs cannot learn the distribution of DCT coefficients because the convolution throws away the spatial coordinates, which are essential for DCT coefficients. We illustrate how to design and train a neural network that can learn the distribution of DCT coefficients. Furthermore, we introduce Compression Artifact Tracing Network (CAT-Net) that jointly uses image acquisition artifacts and compression artifacts. It significantly outperforms traditional and deep neural network-based methods in detecting and localizing tampered regions.

Results

TaskDatasetMetricValueModel
Image Manipulation DetectionCOVERAGEAUC0.68CAT-Net v2
Image Manipulation DetectionCOVERAGEBalanced Accuracy0.635CAT-Net v2
Image Manipulation DetectionColumbiaAUC0.977CAT-Net v2
Image Manipulation DetectionColumbiaBalanced Accuracy0.803CAT-Net v2
Image Manipulation DetectionCocoGlideAUC0.667CAT-Net v2
Image Manipulation DetectionCocoGlideBalanced Accuracy0.58CAT-Net v2
Image Manipulation DetectionDSO-1AUC0.747CAT-Net v2
Image Manipulation DetectionDSO-1Balanced Accuracy0.525CAT-Net v2
Image Manipulation DetectionCasia V1+AUC0.942CAT-Net v2
Image Manipulation DetectionCasia V1+Balanced Accuracy0.838CAT-Net v2
VideoCOVERAGEAUC0.68CAT-Net v2
VideoCOVERAGEBalanced Accuracy0.635CAT-Net v2
VideoColumbiaAUC0.977CAT-Net v2
VideoColumbiaBalanced Accuracy0.803CAT-Net v2
VideoCocoGlideAUC0.667CAT-Net v2
VideoCocoGlideBalanced Accuracy0.58CAT-Net v2
VideoDSO-1AUC0.747CAT-Net v2
VideoDSO-1Balanced Accuracy0.525CAT-Net v2
VideoCasia V1+AUC0.942CAT-Net v2
VideoCasia V1+Balanced Accuracy0.838CAT-Net v2
Temporal Action LocalizationCOVERAGEAUC0.68CAT-Net v2
Temporal Action LocalizationCOVERAGEBalanced Accuracy0.635CAT-Net v2
Temporal Action LocalizationColumbiaAUC0.977CAT-Net v2
Temporal Action LocalizationColumbiaBalanced Accuracy0.803CAT-Net v2
Temporal Action LocalizationCocoGlideAUC0.667CAT-Net v2
Temporal Action LocalizationCocoGlideBalanced Accuracy0.58CAT-Net v2
Temporal Action LocalizationDSO-1AUC0.747CAT-Net v2
Temporal Action LocalizationDSO-1Balanced Accuracy0.525CAT-Net v2
Temporal Action LocalizationCasia V1+AUC0.942CAT-Net v2
Temporal Action LocalizationCasia V1+Balanced Accuracy0.838CAT-Net v2
Anomaly DetectionCOVERAGEAUC0.68CAT-Net v2
Anomaly DetectionCOVERAGEBalanced Accuracy0.635CAT-Net v2
Anomaly DetectionColumbiaAUC0.977CAT-Net v2
Anomaly DetectionColumbiaBalanced Accuracy0.803CAT-Net v2
Anomaly DetectionCocoGlideAUC0.667CAT-Net v2
Anomaly DetectionCocoGlideBalanced Accuracy0.58CAT-Net v2
Anomaly DetectionDSO-1AUC0.747CAT-Net v2
Anomaly DetectionDSO-1Balanced Accuracy0.525CAT-Net v2
Anomaly DetectionCasia V1+AUC0.942CAT-Net v2
Anomaly DetectionCasia V1+Balanced Accuracy0.838CAT-Net v2
Zero-Shot LearningCOVERAGEAUC0.68CAT-Net v2
Zero-Shot LearningCOVERAGEBalanced Accuracy0.635CAT-Net v2
Zero-Shot LearningColumbiaAUC0.977CAT-Net v2
Zero-Shot LearningColumbiaBalanced Accuracy0.803CAT-Net v2
Zero-Shot LearningCocoGlideAUC0.667CAT-Net v2
Zero-Shot LearningCocoGlideBalanced Accuracy0.58CAT-Net v2
Zero-Shot LearningDSO-1AUC0.747CAT-Net v2
Zero-Shot LearningDSO-1Balanced Accuracy0.525CAT-Net v2
Zero-Shot LearningCasia V1+AUC0.942CAT-Net v2
Zero-Shot LearningCasia V1+Balanced Accuracy0.838CAT-Net v2
Activity RecognitionCOVERAGEAUC0.68CAT-Net v2
Activity RecognitionCOVERAGEBalanced Accuracy0.635CAT-Net v2
Activity RecognitionColumbiaAUC0.977CAT-Net v2
Activity RecognitionColumbiaBalanced Accuracy0.803CAT-Net v2
Activity RecognitionCocoGlideAUC0.667CAT-Net v2
Activity RecognitionCocoGlideBalanced Accuracy0.58CAT-Net v2
Activity RecognitionDSO-1AUC0.747CAT-Net v2
Activity RecognitionDSO-1Balanced Accuracy0.525CAT-Net v2
Activity RecognitionCasia V1+AUC0.942CAT-Net v2
Activity RecognitionCasia V1+Balanced Accuracy0.838CAT-Net v2
Action LocalizationCOVERAGEAUC0.68CAT-Net v2
Action LocalizationCOVERAGEBalanced Accuracy0.635CAT-Net v2
Action LocalizationColumbiaAUC0.977CAT-Net v2
Action LocalizationColumbiaBalanced Accuracy0.803CAT-Net v2
Action LocalizationCocoGlideAUC0.667CAT-Net v2
Action LocalizationCocoGlideBalanced Accuracy0.58CAT-Net v2
Action LocalizationDSO-1AUC0.747CAT-Net v2
Action LocalizationDSO-1Balanced Accuracy0.525CAT-Net v2
Action LocalizationCasia V1+AUC0.942CAT-Net v2
Action LocalizationCasia V1+Balanced Accuracy0.838CAT-Net v2
3D Action RecognitionCOVERAGEAUC0.68CAT-Net v2
3D Action RecognitionCOVERAGEBalanced Accuracy0.635CAT-Net v2
3D Action RecognitionColumbiaAUC0.977CAT-Net v2
3D Action RecognitionColumbiaBalanced Accuracy0.803CAT-Net v2
3D Action RecognitionCocoGlideAUC0.667CAT-Net v2
3D Action RecognitionCocoGlideBalanced Accuracy0.58CAT-Net v2
3D Action RecognitionDSO-1AUC0.747CAT-Net v2
3D Action RecognitionDSO-1Balanced Accuracy0.525CAT-Net v2
3D Action RecognitionCasia V1+AUC0.942CAT-Net v2
3D Action RecognitionCasia V1+Balanced Accuracy0.838CAT-Net v2
Action RecognitionCOVERAGEAUC0.68CAT-Net v2
Action RecognitionCOVERAGEBalanced Accuracy0.635CAT-Net v2
Action RecognitionColumbiaAUC0.977CAT-Net v2
Action RecognitionColumbiaBalanced Accuracy0.803CAT-Net v2
Action RecognitionCocoGlideAUC0.667CAT-Net v2
Action RecognitionCocoGlideBalanced Accuracy0.58CAT-Net v2
Action RecognitionDSO-1AUC0.747CAT-Net v2
Action RecognitionDSO-1Balanced Accuracy0.525CAT-Net v2
Action RecognitionCasia V1+AUC0.942CAT-Net v2
Action RecognitionCasia V1+Balanced Accuracy0.838CAT-Net v2
Image Manipulation LocalizationColumbiaAverage Pixel F1(Fixed threshold)0.859CAT-Net v2
Image Manipulation LocalizationColumbia(Protocol-CAT)Pixel Binary F10.915CAT-Net
Image Manipulation LocalizationNIST16(Protocol-CAT)Pixel Binary F10.252CAT-Net
Image Manipulation LocalizationCASIAv1(Protoclo-CAT)Pixel Binary F10.808CAT-Net
Image Manipulation LocalizationCOVERAGEAverage Pixel F1(Fixed threshold)0.381CAT-Net v2
Image Manipulation LocalizationCOVERAGE(Protocol-CAT)Pixel Binary F10.427CAT-Net
Image Manipulation LocalizationCasia V1+Average Pixel F1(Fixed threshold)0.752CAT-Net v2
Image Manipulation LocalizationCocoGlideAverage Pixel F1(Fixed threshold)0.434CAT-Net v2
Image Manipulation LocalizationDSO-1Average Pixel F1(Fixed threshold)0.584CAT-Net v2

Related Papers

Beyond Fully Supervised Pixel Annotations: Scribble-Driven Weakly-Supervised Framework for Image Manipulation Localization2025-07-17Towards Reliable Identification of Diffusion-based Image Manipulations2025-06-05UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation2025-06-03Weakly-supervised Localization of Manipulated Image Regions Using Multi-resolution Learned Features2025-05-29RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs2025-05-22My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping2025-05-21Visual Agentic Reinforcement Fine-Tuning2025-05-20Emerging Properties in Unified Multimodal Pretraining2025-05-20