TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MMFusion: Combining Image Forensic Filters for Visual Mani...

MMFusion: Combining Image Forensic Filters for Visual Manipulation Detection and Localization

Kostas Triaridis, Konstantinos Tsigos, Vasileios Mezaris

2023-12-04Detecting Image ManipulationImage Forgery DetectionImage ForensicsImage Manipulation LocalizationImage ManipulationImage Manipulation Detection
PaperPDFCode(official)

Abstract

Recent image manipulation localization and detection techniques typically leverage forensic artifacts and traces that are produced by a noise-sensitive filter, such as SRM or Bayar convolution. In this paper, we showcase that different filters commonly used in such approaches excel at unveiling different types of manipulations and provide complementary forensic traces. Thus, we explore ways of combining the outputs of such filters to leverage the complementary nature of the produced artifacts for performing image manipulation localization and detection (IMLD). We assess two distinct combination methods: one that produces independent features from each forensic filter and then fuses them (this is referred to as late fusion) and one that performs early mixing of different modal outputs and produces combined features (this is referred to as early fusion). We use the latter as a feature encoding mechanism, accompanied by a new decoding mechanism that encompasses feature re-weighting, for formulating the proposed MMFusion architecture. We demonstrate that MMFusion achieves competitive performance for both image manipulation localization and detection, outperforming state-of-the-art models across several image and video datasets. We also investigate further the contribution of each forensic filter within MMFusion for addressing different types of manipulations, building on recent AI explainability measures.

Results

TaskDatasetMetricValueModel
Image Manipulation DetectionCOVERAGEAUC0.839Early Fusion
Image Manipulation DetectionCOVERAGEBalanced Accuracy0.77Early Fusion
Image Manipulation DetectionCOVERAGEAUC0.792Late Fusion
Image Manipulation DetectionCOVERAGEBalanced Accuracy0.72Late Fusion
Image Manipulation DetectionColumbiaAUC0.996Early Fusion
Image Manipulation DetectionColumbiaBalanced Accuracy0.962Early Fusion
Image Manipulation DetectionColumbiaAUC0.977Late Fusion
Image Manipulation DetectionColumbiaBalanced Accuracy0.822Late Fusion
Image Manipulation DetectionCocoGlideAUC0.76Late Fusion
Image Manipulation DetectionCocoGlideBalanced Accuracy0.677Late Fusion
Image Manipulation DetectionCocoGlideAUC0.755Early Fusion
Image Manipulation DetectionCocoGlideBalanced Accuracy0.66Early Fusion
Image Manipulation DetectionDSO-1AUC0.966Early Fusion
Image Manipulation DetectionDSO-1Balanced Accuracy0.935Early Fusion
Image Manipulation DetectionDSO-1AUC0.958Late Fusion
Image Manipulation DetectionDSO-1Balanced Accuracy0.83Late Fusion
Image Manipulation DetectionCasia V1+AUC0.93Late Fusion
Image Manipulation DetectionCasia V1+Balanced Accuracy0.86Late Fusion
Image Manipulation DetectionCasia V1+AUC0.929Early Fusion
Image Manipulation DetectionCasia V1+Balanced Accuracy0.845Early Fusion
VideoCOVERAGEAUC0.839Early Fusion
VideoCOVERAGEBalanced Accuracy0.77Early Fusion
VideoCOVERAGEAUC0.792Late Fusion
VideoCOVERAGEBalanced Accuracy0.72Late Fusion
VideoColumbiaAUC0.996Early Fusion
VideoColumbiaBalanced Accuracy0.962Early Fusion
VideoColumbiaAUC0.977Late Fusion
VideoColumbiaBalanced Accuracy0.822Late Fusion
VideoCocoGlideAUC0.76Late Fusion
VideoCocoGlideBalanced Accuracy0.677Late Fusion
VideoCocoGlideAUC0.755Early Fusion
VideoCocoGlideBalanced Accuracy0.66Early Fusion
VideoDSO-1AUC0.966Early Fusion
VideoDSO-1Balanced Accuracy0.935Early Fusion
VideoDSO-1AUC0.958Late Fusion
VideoDSO-1Balanced Accuracy0.83Late Fusion
VideoCasia V1+AUC0.93Late Fusion
VideoCasia V1+Balanced Accuracy0.86Late Fusion
VideoCasia V1+AUC0.929Early Fusion
VideoCasia V1+Balanced Accuracy0.845Early Fusion
Temporal Action LocalizationCOVERAGEAUC0.839Early Fusion
Temporal Action LocalizationCOVERAGEBalanced Accuracy0.77Early Fusion
Temporal Action LocalizationCOVERAGEAUC0.792Late Fusion
Temporal Action LocalizationCOVERAGEBalanced Accuracy0.72Late Fusion
Temporal Action LocalizationColumbiaAUC0.996Early Fusion
Temporal Action LocalizationColumbiaBalanced Accuracy0.962Early Fusion
Temporal Action LocalizationColumbiaAUC0.977Late Fusion
Temporal Action LocalizationColumbiaBalanced Accuracy0.822Late Fusion
Temporal Action LocalizationCocoGlideAUC0.76Late Fusion
Temporal Action LocalizationCocoGlideBalanced Accuracy0.677Late Fusion
Temporal Action LocalizationCocoGlideAUC0.755Early Fusion
Temporal Action LocalizationCocoGlideBalanced Accuracy0.66Early Fusion
Temporal Action LocalizationDSO-1AUC0.966Early Fusion
Temporal Action LocalizationDSO-1Balanced Accuracy0.935Early Fusion
Temporal Action LocalizationDSO-1AUC0.958Late Fusion
Temporal Action LocalizationDSO-1Balanced Accuracy0.83Late Fusion
Temporal Action LocalizationCasia V1+AUC0.93Late Fusion
Temporal Action LocalizationCasia V1+Balanced Accuracy0.86Late Fusion
Temporal Action LocalizationCasia V1+AUC0.929Early Fusion
Temporal Action LocalizationCasia V1+Balanced Accuracy0.845Early Fusion
Anomaly DetectionCOVERAGEAUC0.839Early Fusion
Anomaly DetectionCOVERAGEBalanced Accuracy0.77Early Fusion
Anomaly DetectionCOVERAGEAUC0.792Late Fusion
Anomaly DetectionCOVERAGEBalanced Accuracy0.72Late Fusion
Anomaly DetectionColumbiaAUC0.996Early Fusion
Anomaly DetectionColumbiaBalanced Accuracy0.962Early Fusion
Anomaly DetectionColumbiaAUC0.977Late Fusion
Anomaly DetectionColumbiaBalanced Accuracy0.822Late Fusion
Anomaly DetectionCocoGlideAUC0.76Late Fusion
Anomaly DetectionCocoGlideBalanced Accuracy0.677Late Fusion
Anomaly DetectionCocoGlideAUC0.755Early Fusion
Anomaly DetectionCocoGlideBalanced Accuracy0.66Early Fusion
Anomaly DetectionDSO-1AUC0.966Early Fusion
Anomaly DetectionDSO-1Balanced Accuracy0.935Early Fusion
Anomaly DetectionDSO-1AUC0.958Late Fusion
Anomaly DetectionDSO-1Balanced Accuracy0.83Late Fusion
Anomaly DetectionCasia V1+AUC0.93Late Fusion
Anomaly DetectionCasia V1+Balanced Accuracy0.86Late Fusion
Anomaly DetectionCasia V1+AUC0.929Early Fusion
Anomaly DetectionCasia V1+Balanced Accuracy0.845Early Fusion
Zero-Shot LearningCOVERAGEAUC0.839Early Fusion
Zero-Shot LearningCOVERAGEBalanced Accuracy0.77Early Fusion
Zero-Shot LearningCOVERAGEAUC0.792Late Fusion
Zero-Shot LearningCOVERAGEBalanced Accuracy0.72Late Fusion
Zero-Shot LearningColumbiaAUC0.996Early Fusion
Zero-Shot LearningColumbiaBalanced Accuracy0.962Early Fusion
Zero-Shot LearningColumbiaAUC0.977Late Fusion
Zero-Shot LearningColumbiaBalanced Accuracy0.822Late Fusion
Zero-Shot LearningCocoGlideAUC0.76Late Fusion
Zero-Shot LearningCocoGlideBalanced Accuracy0.677Late Fusion
Zero-Shot LearningCocoGlideAUC0.755Early Fusion
Zero-Shot LearningCocoGlideBalanced Accuracy0.66Early Fusion
Zero-Shot LearningDSO-1AUC0.966Early Fusion
Zero-Shot LearningDSO-1Balanced Accuracy0.935Early Fusion
Zero-Shot LearningDSO-1AUC0.958Late Fusion
Zero-Shot LearningDSO-1Balanced Accuracy0.83Late Fusion
Zero-Shot LearningCasia V1+AUC0.93Late Fusion
Zero-Shot LearningCasia V1+Balanced Accuracy0.86Late Fusion
Zero-Shot LearningCasia V1+AUC0.929Early Fusion
Zero-Shot LearningCasia V1+Balanced Accuracy0.845Early Fusion
Activity RecognitionCOVERAGEAUC0.839Early Fusion
Activity RecognitionCOVERAGEBalanced Accuracy0.77Early Fusion
Activity RecognitionCOVERAGEAUC0.792Late Fusion
Activity RecognitionCOVERAGEBalanced Accuracy0.72Late Fusion
Activity RecognitionColumbiaAUC0.996Early Fusion
Activity RecognitionColumbiaBalanced Accuracy0.962Early Fusion
Activity RecognitionColumbiaAUC0.977Late Fusion
Activity RecognitionColumbiaBalanced Accuracy0.822Late Fusion
Activity RecognitionCocoGlideAUC0.76Late Fusion
Activity RecognitionCocoGlideBalanced Accuracy0.677Late Fusion
Activity RecognitionCocoGlideAUC0.755Early Fusion
Activity RecognitionCocoGlideBalanced Accuracy0.66Early Fusion
Activity RecognitionDSO-1AUC0.966Early Fusion
Activity RecognitionDSO-1Balanced Accuracy0.935Early Fusion
Activity RecognitionDSO-1AUC0.958Late Fusion
Activity RecognitionDSO-1Balanced Accuracy0.83Late Fusion
Activity RecognitionCasia V1+AUC0.93Late Fusion
Activity RecognitionCasia V1+Balanced Accuracy0.86Late Fusion
Activity RecognitionCasia V1+AUC0.929Early Fusion
Activity RecognitionCasia V1+Balanced Accuracy0.845Early Fusion
Action LocalizationCOVERAGEAUC0.839Early Fusion
Action LocalizationCOVERAGEBalanced Accuracy0.77Early Fusion
Action LocalizationCOVERAGEAUC0.792Late Fusion
Action LocalizationCOVERAGEBalanced Accuracy0.72Late Fusion
Action LocalizationColumbiaAUC0.996Early Fusion
Action LocalizationColumbiaBalanced Accuracy0.962Early Fusion
Action LocalizationColumbiaAUC0.977Late Fusion
Action LocalizationColumbiaBalanced Accuracy0.822Late Fusion
Action LocalizationCocoGlideAUC0.76Late Fusion
Action LocalizationCocoGlideBalanced Accuracy0.677Late Fusion
Action LocalizationCocoGlideAUC0.755Early Fusion
Action LocalizationCocoGlideBalanced Accuracy0.66Early Fusion
Action LocalizationDSO-1AUC0.966Early Fusion
Action LocalizationDSO-1Balanced Accuracy0.935Early Fusion
Action LocalizationDSO-1AUC0.958Late Fusion
Action LocalizationDSO-1Balanced Accuracy0.83Late Fusion
Action LocalizationCasia V1+AUC0.93Late Fusion
Action LocalizationCasia V1+Balanced Accuracy0.86Late Fusion
Action LocalizationCasia V1+AUC0.929Early Fusion
Action LocalizationCasia V1+Balanced Accuracy0.845Early Fusion
3D Action RecognitionCOVERAGEAUC0.839Early Fusion
3D Action RecognitionCOVERAGEBalanced Accuracy0.77Early Fusion
3D Action RecognitionCOVERAGEAUC0.792Late Fusion
3D Action RecognitionCOVERAGEBalanced Accuracy0.72Late Fusion
3D Action RecognitionColumbiaAUC0.996Early Fusion
3D Action RecognitionColumbiaBalanced Accuracy0.962Early Fusion
3D Action RecognitionColumbiaAUC0.977Late Fusion
3D Action RecognitionColumbiaBalanced Accuracy0.822Late Fusion
3D Action RecognitionCocoGlideAUC0.76Late Fusion
3D Action RecognitionCocoGlideBalanced Accuracy0.677Late Fusion
3D Action RecognitionCocoGlideAUC0.755Early Fusion
3D Action RecognitionCocoGlideBalanced Accuracy0.66Early Fusion
3D Action RecognitionDSO-1AUC0.966Early Fusion
3D Action RecognitionDSO-1Balanced Accuracy0.935Early Fusion
3D Action RecognitionDSO-1AUC0.958Late Fusion
3D Action RecognitionDSO-1Balanced Accuracy0.83Late Fusion
3D Action RecognitionCasia V1+AUC0.93Late Fusion
3D Action RecognitionCasia V1+Balanced Accuracy0.86Late Fusion
3D Action RecognitionCasia V1+AUC0.929Early Fusion
3D Action RecognitionCasia V1+Balanced Accuracy0.845Early Fusion
Action RecognitionCOVERAGEAUC0.839Early Fusion
Action RecognitionCOVERAGEBalanced Accuracy0.77Early Fusion
Action RecognitionCOVERAGEAUC0.792Late Fusion
Action RecognitionCOVERAGEBalanced Accuracy0.72Late Fusion
Action RecognitionColumbiaAUC0.996Early Fusion
Action RecognitionColumbiaBalanced Accuracy0.962Early Fusion
Action RecognitionColumbiaAUC0.977Late Fusion
Action RecognitionColumbiaBalanced Accuracy0.822Late Fusion
Action RecognitionCocoGlideAUC0.76Late Fusion
Action RecognitionCocoGlideBalanced Accuracy0.677Late Fusion
Action RecognitionCocoGlideAUC0.755Early Fusion
Action RecognitionCocoGlideBalanced Accuracy0.66Early Fusion
Action RecognitionDSO-1AUC0.966Early Fusion
Action RecognitionDSO-1Balanced Accuracy0.935Early Fusion
Action RecognitionDSO-1AUC0.958Late Fusion
Action RecognitionDSO-1Balanced Accuracy0.83Late Fusion
Action RecognitionCasia V1+AUC0.93Late Fusion
Action RecognitionCasia V1+Balanced Accuracy0.86Late Fusion
Action RecognitionCasia V1+AUC0.929Early Fusion
Action RecognitionCasia V1+Balanced Accuracy0.845Early Fusion
Image Manipulation LocalizationColumbiaAverage Pixel F1(Fixed threshold)0.888Early Fusion
Image Manipulation LocalizationColumbiaAverage Pixel F1(Fixed threshold)0.864Late Fusion
Image Manipulation LocalizationCOVERAGEAverage Pixel F1(Fixed threshold)0.663Early Fusion
Image Manipulation LocalizationCOVERAGEAverage Pixel F1(Fixed threshold)0.641Late Fusion
Image Manipulation LocalizationCasia V1+Average Pixel F1(Fixed threshold)0.784Early Fusion
Image Manipulation LocalizationCasia V1+Average Pixel F1(Fixed threshold)0.775Late Fusion
Image Manipulation LocalizationCocoGlideAverage Pixel F1(Fixed threshold)0.574Late Fusion
Image Manipulation LocalizationCocoGlideAverage Pixel F1(Fixed threshold)0.553Early Fusion
Image Manipulation LocalizationDSO-1Average Pixel F1(Fixed threshold)0.899Late Fusion
Image Manipulation LocalizationDSO-1Average Pixel F1(Fixed threshold)0.869Early Fusion

Related Papers

Beyond Fully Supervised Pixel Annotations: Scribble-Driven Weakly-Supervised Framework for Image Manipulation Localization2025-07-17SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection2025-06-25Towards Reliable Identification of Diffusion-based Image Manipulations2025-06-05UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation2025-06-03Weakly-supervised Localization of Manipulated Image Regions Using Multi-resolution Learned Features2025-05-29So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection2025-05-24Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces2025-05-22Can ChatGPT Perform Image Splicing Detection? A Preliminary Study2025-05-22