TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Leveraging Real Talking Faces via Self-Supervision for Rob...

Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

Alexandros Haliassos, Rodrigo Mira, Stavros Petridis, Maja Pantic

2022-01-18CVPR 2022 1DeepFake Detection
PaperPDFCode(official)

Abstract

One of the most pressing challenges for the detection of face-manipulated videos is generalising to forgery methods not seen during training while remaining effective under common corruptions such as compression. In this paper, we examine whether we can tackle this issue by harnessing videos of real talking faces, which contain rich information on natural facial appearance and behaviour and are readily available in large quantities online. Our method, termed RealForensics, consists of two stages. First, we exploit the natural correspondence between the visual and auditory modalities in real videos to learn, in a self-supervised cross-modal manner, temporally dense video representations that capture factors such as facial movements, expression, and identity. Second, we use these learned representations as targets to be predicted by our forgery detector along with the usual binary forgery classification task; this encourages it to base its real/fake decision on said factors. We show that our method achieves state-of-the-art performance on cross-manipulation generalisation and robustness experiments, and examine the factors that contribute to its performance. Our results suggest that leveraging natural and unlabelled videos is a promising direction for the development of more robust face forgery detectors.

Results

TaskDatasetMetricValueModel
3D ReconstructionFakeAVCelebAP95.3RealForensics
3D ReconstructionFakeAVCelebROC AUC97.1RealForensics
3D ReconstructionFakeAVCelebAP73.9AVBYOL
3D ReconstructionFakeAVCelebROC AUC59.2AVBYOL
3DFakeAVCelebAP95.3RealForensics
3DFakeAVCelebROC AUC97.1RealForensics
3DFakeAVCelebAP73.9AVBYOL
3DFakeAVCelebROC AUC59.2AVBYOL
DeepFake DetectionFakeAVCelebAP95.3RealForensics
DeepFake DetectionFakeAVCelebROC AUC97.1RealForensics
DeepFake DetectionFakeAVCelebAP73.9AVBYOL
DeepFake DetectionFakeAVCelebROC AUC59.2AVBYOL
3D Shape Reconstruction from VideosFakeAVCelebAP95.3RealForensics
3D Shape Reconstruction from VideosFakeAVCelebROC AUC97.1RealForensics
3D Shape Reconstruction from VideosFakeAVCelebAP73.9AVBYOL
3D Shape Reconstruction from VideosFakeAVCelebROC AUC59.2AVBYOL

Related Papers

SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks2025-07-17DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios2025-06-29Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks2025-06-25IndieFake Dataset: A Benchmark Dataset for Audio Deepfake Detection2025-06-23SELFI: Selective Fusion of Identity for Generalizable Deepfake Detection2025-06-21FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes2025-06-13From Sharpness to Better Generalization for Speech Deepfake Detection2025-06-13LLMs Are Not Yet Ready for Deepfake Image Detection2025-06-12