TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Video Face Manipulation Detection Through Ensemble of CNNs

Video Face Manipulation Detection Through Ensemble of CNNs

Nicolò Bonettini, Edoardo Daniele Cannas, Sara Mandelli, Luca Bondi, Paolo Bestagini, Stefano Tubaro

2020-04-16Detecting Image ManipulationDeepFake DetectionLocalization In Video ForgeryImage Manipulation DetectionVideo ForensicsFake Image Detection
PaperPDFCode(official)CodeCode

Abstract

In the last few years, several techniques for facial manipulation in videos have been successfully developed and made available to the masses (i.e., FaceSwap, deepfake, etc.). These methods enable anyone to easily edit faces in video sequences with incredibly realistic results and a very little effort. Despite the usefulness of these tools in many fields, if used maliciously, they can have a significantly bad impact on society (e.g., fake news spreading, cyber bullying through fake revenge porn). The ability of objectively detecting whether a face has been manipulated in a video sequence is then a task of utmost importance. In this paper, we tackle the problem of face manipulation detection in video sequences targeting modern facial manipulation techniques. In particular, we study the ensembling of different trained Convolutional Neural Network (CNN) models. In the proposed solution, different models are obtained starting from a base network (i.e., EfficientNetB4) making use of two different concepts: (i) attention layers; (ii) siamese training. We show that combining these networks leads to promising face manipulation detection results on two publicly available datasets with more than 119000 videos.

Results

TaskDatasetMetricValueModel
3D ReconstructionDFDCLogLoss0.464EfficientNetB4 + EfficientNetB4ST + B4Att
3D ReconstructionFaceForensics++AUC0.9444EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
3D ReconstructionFaceForensics++LogLoss0.3269EfficientNetB4 + EfficientNetB4ST + B4AttST
3DDFDCLogLoss0.464EfficientNetB4 + EfficientNetB4ST + B4Att
3DFaceForensics++AUC0.9444EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
3DFaceForensics++LogLoss0.3269EfficientNetB4 + EfficientNetB4ST + B4AttST
DeepFake DetectionDFDCLogLoss0.464EfficientNetB4 + EfficientNetB4ST + B4Att
DeepFake DetectionFaceForensics++AUC0.9444EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
DeepFake DetectionFaceForensics++LogLoss0.3269EfficientNetB4 + EfficientNetB4ST + B4AttST
3D Shape Reconstruction from VideosDFDCLogLoss0.464EfficientNetB4 + EfficientNetB4ST + B4Att
3D Shape Reconstruction from VideosFaceForensics++AUC0.9444EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
3D Shape Reconstruction from VideosFaceForensics++LogLoss0.3269EfficientNetB4 + EfficientNetB4ST + B4AttST

Related Papers

SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks2025-07-17DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios2025-06-29Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks2025-06-25IndieFake Dataset: A Benchmark Dataset for Audio Deepfake Detection2025-06-23SELFI: Selective Fusion of Identity for Generalizable Deepfake Detection2025-06-21FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes2025-06-13From Sharpness to Better Generalization for Speech Deepfake Detection2025-06-13LLMs Are Not Yet Ready for Deepfake Image Detection2025-06-12