Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

Alexandros Haliassos, Rodrigo Mira, Stavros Petridis, Maja Pantic

2022-01-18CVPR 2022 1DeepFake Detection

Abstract

One of the most pressing challenges for the detection of face-manipulated videos is generalising to forgery methods not seen during training while remaining effective under common corruptions such as compression. In this paper, we examine whether we can tackle this issue by harnessing videos of real talking faces, which contain rich information on natural facial appearance and behaviour and are readily available in large quantities online. Our method, termed RealForensics, consists of two stages. First, we exploit the natural correspondence between the visual and auditory modalities in real videos to learn, in a self-supervised cross-modal manner, temporally dense video representations that capture factors such as facial movements, expression, and identity. Second, we use these learned representations as targets to be predicted by our forgery detector along with the usual binary forgery classification task; this encourages it to base its real/fake decision on said factors. We show that our method achieves state-of-the-art performance on cross-manipulation generalisation and robustness experiments, and examine the factors that contribute to its performance. Our results suggest that leveraging natural and unlabelled videos is a promising direction for the development of more robust face forgery detectors.

Results

Task	Dataset	Metric	Value	Model
3D Reconstruction	FakeAVCeleb	AP	95.3	RealForensics
3D Reconstruction	FakeAVCeleb	ROC AUC	97.1	RealForensics
3D Reconstruction	FakeAVCeleb	AP	73.9	AVBYOL
3D Reconstruction	FakeAVCeleb	ROC AUC	59.2	AVBYOL
3D	FakeAVCeleb	AP	95.3	RealForensics
3D	FakeAVCeleb	ROC AUC	97.1	RealForensics
3D	FakeAVCeleb	AP	73.9	AVBYOL
3D	FakeAVCeleb	ROC AUC	59.2	AVBYOL
DeepFake Detection	FakeAVCeleb	AP	95.3	RealForensics
DeepFake Detection	FakeAVCeleb	ROC AUC	97.1	RealForensics
DeepFake Detection	FakeAVCeleb	AP	73.9	AVBYOL
DeepFake Detection	FakeAVCeleb	ROC AUC	59.2	AVBYOL
3D Shape Reconstruction from Videos	FakeAVCeleb	AP	95.3	RealForensics
3D Shape Reconstruction from Videos	FakeAVCeleb	ROC AUC	97.1	RealForensics
3D Shape Reconstruction from Videos	FakeAVCeleb	AP	73.9	AVBYOL
3D Shape Reconstruction from Videos	FakeAVCeleb	ROC AUC	59.2	AVBYOL

Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

Abstract

Results

Related Papers

Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

Abstract

Results

Related Papers