TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Exploring Temporal Coherence for More General Video Face F...

Exploring Temporal Coherence for More General Video Face Forgery Detection

Yinglin Zheng, Jianmin Bao, Dong Chen, Ming Zeng, Fang Wen

2021-08-15ICCV 2021 10DeepFake Detection
PaperPDFCode

Abstract

Although current face manipulation techniques achieve impressive performance regarding quality and controllability, they are struggling to generate temporal coherent face videos. In this work, we explore to take full advantage of the temporal coherence for video face forgery detection. To achieve this, we propose a novel end-to-end framework, which consists of two major stages. The first stage is a fully temporal convolution network (FTCN). The key insight of FTCN is to reduce the spatial convolution kernel size to 1, while maintaining the temporal convolution kernel size unchanged. We surprisingly find this special design can benefit the model for extracting the temporal features as well as improve the generalization capability. The second stage is a Temporal Transformer network, which aims to explore the long-term temporal coherence. The proposed framework is general and flexible, which can be directly trained from scratch without any pre-training models or external datasets. Extensive experiments show that our framework outperforms existing methods and remains effective when applied to detect new sorts of face forgery videos.

Results

TaskDatasetMetricValueModel
3D ReconstructionFakeAVCelebAP92.3FTCN
3D ReconstructionFakeAVCelebROC AUC93.1FTCN
3DFakeAVCelebAP92.3FTCN
3DFakeAVCelebROC AUC93.1FTCN
DeepFake DetectionFakeAVCelebAP92.3FTCN
DeepFake DetectionFakeAVCelebROC AUC93.1FTCN
3D Shape Reconstruction from VideosFakeAVCelebAP92.3FTCN
3D Shape Reconstruction from VideosFakeAVCelebROC AUC93.1FTCN

Related Papers

SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks2025-07-17DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios2025-06-29Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks2025-06-25IndieFake Dataset: A Benchmark Dataset for Audio Deepfake Detection2025-06-23SELFI: Selective Fusion of Identity for Generalizable Deepfake Detection2025-06-21FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes2025-06-13From Sharpness to Better Generalization for Speech Deepfake Detection2025-06-13LLMs Are Not Yet Ready for Deepfake Image Detection2025-06-12