Video Face Manipulation Detection Through Ensemble of CNNs

Nicolò Bonettini, Edoardo Daniele Cannas, Sara Mandelli, Luca Bondi, Paolo Bestagini, Stefano Tubaro

2020-04-16Detecting Image Manipulation DeepFake Detection Localization In Video Forgery Image Manipulation Detection Video Forensics Fake Image Detection

Paper PDF Code(official)Code Code

Abstract

In the last few years, several techniques for facial manipulation in videos have been successfully developed and made available to the masses (i.e., FaceSwap, deepfake, etc.). These methods enable anyone to easily edit faces in video sequences with incredibly realistic results and a very little effort. Despite the usefulness of these tools in many fields, if used maliciously, they can have a significantly bad impact on society (e.g., fake news spreading, cyber bullying through fake revenge porn). The ability of objectively detecting whether a face has been manipulated in a video sequence is then a task of utmost importance. In this paper, we tackle the problem of face manipulation detection in video sequences targeting modern facial manipulation techniques. In particular, we study the ensembling of different trained Convolutional Neural Network (CNN) models. In the proposed solution, different models are obtained starting from a base network (i.e., EfficientNetB4) making use of two different concepts: (i) attention layers; (ii) siamese training. We show that combining these networks leads to promising face manipulation detection results on two publicly available datasets with more than 119000 videos.

Results

Task	Dataset	Metric	Value	Model
3D Reconstruction	DFDC	LogLoss	0.464	EfficientNetB4 + EfficientNetB4ST + B4Att
3D Reconstruction	FaceForensics++	AUC	0.9444	EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
3D Reconstruction	FaceForensics++	LogLoss	0.3269	EfficientNetB4 + EfficientNetB4ST + B4AttST
3D	DFDC	LogLoss	0.464	EfficientNetB4 + EfficientNetB4ST + B4Att
3D	FaceForensics++	AUC	0.9444	EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
3D	FaceForensics++	LogLoss	0.3269	EfficientNetB4 + EfficientNetB4ST + B4AttST
DeepFake Detection	DFDC	LogLoss	0.464	EfficientNetB4 + EfficientNetB4ST + B4Att
DeepFake Detection	FaceForensics++	AUC	0.9444	EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
DeepFake Detection	FaceForensics++	LogLoss	0.3269	EfficientNetB4 + EfficientNetB4ST + B4AttST
3D Shape Reconstruction from Videos	DFDC	LogLoss	0.464	EfficientNetB4 + EfficientNetB4ST + B4Att
3D Shape Reconstruction from Videos	FaceForensics++	AUC	0.9444	EfficientNetB4 + EfficientNetB4ST + B4Att + B4AttST
3D Shape Reconstruction from Videos	FaceForensics++	LogLoss	0.3269	EfficientNetB4 + EfficientNetB4ST + B4AttST

Video Face Manipulation Detection Through Ensemble of CNNs

Abstract

Results

Related Papers

Video Face Manipulation Detection Through Ensemble of CNNs

Abstract

Results

Related Papers