TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Image Manipulation Detection by Multi-View Multi-Scale Sup...

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Xinru Chen, Chengbo Dong, Jiaqi Ji, Juan Cao, Xirong Li

2021-04-14ICCV 2021 10Semantic SegmentationImage Manipulation LocalizationSpecificityImage ManipulationImage Manipulation Detection
PaperPDFCode(official)Code

Abstract

The key challenge of image manipulation detection is how to learn generalizable features that are sensitive to manipulations in novel data, whilst specific to prevent false alarms on authentic images. Current research emphasizes the sensitivity, with the specificity overlooked. In this paper we address both aspects by multi-view feature learning and multi-scale supervision. By exploiting noise distribution and boundary artifact surrounding tampered regions, the former aims to learn semantic-agnostic and thus more generalizable features. The latter allows us to learn from authentic images which are nontrivial to be taken into account by current semantic segmentation network based methods. Our thoughts are realized by a new network which we term MVSS-Net. Extensive experiments on five benchmark sets justify the viability of MVSS-Net for both pixel-level and image-level manipulation detection.

Results

TaskDatasetMetricValueModel
Image Manipulation DetectionCOVERAGEAUC0.733MVSS-Net
Image Manipulation DetectionCOVERAGEBalanced Accuracy0.514MVSS-Net
Image Manipulation DetectionColumbiaAUC0.984MVSS-Net
Image Manipulation DetectionColumbiaBalanced Accuracy0.729MVSS-Net
Image Manipulation DetectionCocoGlideAUC0.654MVSS-Net
Image Manipulation DetectionCocoGlideBalanced Accuracy0.117MVSS-Net
Image Manipulation DetectionDSO-1AUC0.552MVSS-Net
Image Manipulation DetectionDSO-1Balanced Accuracy0.358MVSS-Net
Image Manipulation DetectionCasia V1+AUC0.932MVSS-Net
Image Manipulation DetectionCasia V1+Balanced Accuracy0.528MVSS-Net
VideoCOVERAGEAUC0.733MVSS-Net
VideoCOVERAGEBalanced Accuracy0.514MVSS-Net
VideoColumbiaAUC0.984MVSS-Net
VideoColumbiaBalanced Accuracy0.729MVSS-Net
VideoCocoGlideAUC0.654MVSS-Net
VideoCocoGlideBalanced Accuracy0.117MVSS-Net
VideoDSO-1AUC0.552MVSS-Net
VideoDSO-1Balanced Accuracy0.358MVSS-Net
VideoCasia V1+AUC0.932MVSS-Net
VideoCasia V1+Balanced Accuracy0.528MVSS-Net
Temporal Action LocalizationCOVERAGEAUC0.733MVSS-Net
Temporal Action LocalizationCOVERAGEBalanced Accuracy0.514MVSS-Net
Temporal Action LocalizationColumbiaAUC0.984MVSS-Net
Temporal Action LocalizationColumbiaBalanced Accuracy0.729MVSS-Net
Temporal Action LocalizationCocoGlideAUC0.654MVSS-Net
Temporal Action LocalizationCocoGlideBalanced Accuracy0.117MVSS-Net
Temporal Action LocalizationDSO-1AUC0.552MVSS-Net
Temporal Action LocalizationDSO-1Balanced Accuracy0.358MVSS-Net
Temporal Action LocalizationCasia V1+AUC0.932MVSS-Net
Temporal Action LocalizationCasia V1+Balanced Accuracy0.528MVSS-Net
Anomaly DetectionCOVERAGEAUC0.733MVSS-Net
Anomaly DetectionCOVERAGEBalanced Accuracy0.514MVSS-Net
Anomaly DetectionColumbiaAUC0.984MVSS-Net
Anomaly DetectionColumbiaBalanced Accuracy0.729MVSS-Net
Anomaly DetectionCocoGlideAUC0.654MVSS-Net
Anomaly DetectionCocoGlideBalanced Accuracy0.117MVSS-Net
Anomaly DetectionDSO-1AUC0.552MVSS-Net
Anomaly DetectionDSO-1Balanced Accuracy0.358MVSS-Net
Anomaly DetectionCasia V1+AUC0.932MVSS-Net
Anomaly DetectionCasia V1+Balanced Accuracy0.528MVSS-Net
Zero-Shot LearningCOVERAGEAUC0.733MVSS-Net
Zero-Shot LearningCOVERAGEBalanced Accuracy0.514MVSS-Net
Zero-Shot LearningColumbiaAUC0.984MVSS-Net
Zero-Shot LearningColumbiaBalanced Accuracy0.729MVSS-Net
Zero-Shot LearningCocoGlideAUC0.654MVSS-Net
Zero-Shot LearningCocoGlideBalanced Accuracy0.117MVSS-Net
Zero-Shot LearningDSO-1AUC0.552MVSS-Net
Zero-Shot LearningDSO-1Balanced Accuracy0.358MVSS-Net
Zero-Shot LearningCasia V1+AUC0.932MVSS-Net
Zero-Shot LearningCasia V1+Balanced Accuracy0.528MVSS-Net
Activity RecognitionCOVERAGEAUC0.733MVSS-Net
Activity RecognitionCOVERAGEBalanced Accuracy0.514MVSS-Net
Activity RecognitionColumbiaAUC0.984MVSS-Net
Activity RecognitionColumbiaBalanced Accuracy0.729MVSS-Net
Activity RecognitionCocoGlideAUC0.654MVSS-Net
Activity RecognitionCocoGlideBalanced Accuracy0.117MVSS-Net
Activity RecognitionDSO-1AUC0.552MVSS-Net
Activity RecognitionDSO-1Balanced Accuracy0.358MVSS-Net
Activity RecognitionCasia V1+AUC0.932MVSS-Net
Activity RecognitionCasia V1+Balanced Accuracy0.528MVSS-Net
Action LocalizationCOVERAGEAUC0.733MVSS-Net
Action LocalizationCOVERAGEBalanced Accuracy0.514MVSS-Net
Action LocalizationColumbiaAUC0.984MVSS-Net
Action LocalizationColumbiaBalanced Accuracy0.729MVSS-Net
Action LocalizationCocoGlideAUC0.654MVSS-Net
Action LocalizationCocoGlideBalanced Accuracy0.117MVSS-Net
Action LocalizationDSO-1AUC0.552MVSS-Net
Action LocalizationDSO-1Balanced Accuracy0.358MVSS-Net
Action LocalizationCasia V1+AUC0.932MVSS-Net
Action LocalizationCasia V1+Balanced Accuracy0.528MVSS-Net
3D Action RecognitionCOVERAGEAUC0.733MVSS-Net
3D Action RecognitionCOVERAGEBalanced Accuracy0.514MVSS-Net
3D Action RecognitionColumbiaAUC0.984MVSS-Net
3D Action RecognitionColumbiaBalanced Accuracy0.729MVSS-Net
3D Action RecognitionCocoGlideAUC0.654MVSS-Net
3D Action RecognitionCocoGlideBalanced Accuracy0.117MVSS-Net
3D Action RecognitionDSO-1AUC0.552MVSS-Net
3D Action RecognitionDSO-1Balanced Accuracy0.358MVSS-Net
3D Action RecognitionCasia V1+AUC0.932MVSS-Net
3D Action RecognitionCasia V1+Balanced Accuracy0.528MVSS-Net
Action RecognitionCOVERAGEAUC0.733MVSS-Net
Action RecognitionCOVERAGEBalanced Accuracy0.514MVSS-Net
Action RecognitionColumbiaAUC0.984MVSS-Net
Action RecognitionColumbiaBalanced Accuracy0.729MVSS-Net
Action RecognitionCocoGlideAUC0.654MVSS-Net
Action RecognitionCocoGlideBalanced Accuracy0.117MVSS-Net
Action RecognitionDSO-1AUC0.552MVSS-Net
Action RecognitionDSO-1Balanced Accuracy0.358MVSS-Net
Action RecognitionCasia V1+AUC0.932MVSS-Net
Action RecognitionCasia V1+Balanced Accuracy0.528MVSS-Net
Image Manipulation LocalizationColumbiaAverage Pixel F1(Fixed threshold)0.729MVSS-Net
Image Manipulation LocalizationCOVERAGEAverage Pixel F1(Fixed threshold)0.514MVSS-Net
Image Manipulation LocalizationCasia V1+Average Pixel F1(Fixed threshold)0.528MVSS-Net
Image Manipulation LocalizationCocoGlideAverage Pixel F1(Fixed threshold)0.486MVSS-Net
Image Manipulation LocalizationDSO-1Average Pixel F1(Fixed threshold)0.358MVSS-Net

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Beyond Fully Supervised Pixel Annotations: Scribble-Driven Weakly-Supervised Framework for Image Manipulation Localization2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15