Fabrizio Guillaro, Davide Cozzolino, Avneesh Sud, Nicholas Dufour, Luisa Verdoliva
In this paper we present TruFor, a forensic framework that can be applied to a large variety of image manipulation methods, from classic cheapfakes to more recent manipulations based on deep learning. We rely on the extraction of both high-level and low-level traces through a transformer-based fusion architecture that combines the RGB image and a learned noise-sensitive fingerprint. The latter learns to embed the artifacts related to the camera internal and external processing by training only on real data in a self-supervised manner. Forgeries are detected as deviations from the expected regular pattern that characterizes each pristine image. Looking for anomalies makes the approach able to robustly detect a variety of local manipulations, ensuring generalization. In addition to a pixel-level localization map and a whole-image integrity score, our approach outputs a reliability map that highlights areas where localization predictions may be error-prone. This is particularly important in forensic applications in order to reduce false alarms and allow for a large scale analysis. Extensive experiments on several datasets show that our method is able to reliably detect and localize both cheapfakes and deepfakes manipulations outperforming state-of-the-art works. Code is publicly available at https://grip-unina.github.io/TruFor/
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Manipulation Detection | COVERAGE | AUC | 0.77 | TruFor |
| Image Manipulation Detection | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Image Manipulation Detection | Columbia | AUC | 0.996 | TruFor |
| Image Manipulation Detection | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Image Manipulation Detection | CocoGlide | AUC | 0.752 | TruFor |
| Image Manipulation Detection | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Image Manipulation Detection | DSO-1 | AUC | 0.984 | TruFor |
| Image Manipulation Detection | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Image Manipulation Detection | Casia V1+ | AUC | 0.916 | TruFor |
| Image Manipulation Detection | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Video | COVERAGE | AUC | 0.77 | TruFor |
| Video | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Video | Columbia | AUC | 0.996 | TruFor |
| Video | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Video | CocoGlide | AUC | 0.752 | TruFor |
| Video | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Video | DSO-1 | AUC | 0.984 | TruFor |
| Video | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Video | Casia V1+ | AUC | 0.916 | TruFor |
| Video | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Temporal Action Localization | COVERAGE | AUC | 0.77 | TruFor |
| Temporal Action Localization | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Temporal Action Localization | Columbia | AUC | 0.996 | TruFor |
| Temporal Action Localization | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Temporal Action Localization | CocoGlide | AUC | 0.752 | TruFor |
| Temporal Action Localization | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Temporal Action Localization | DSO-1 | AUC | 0.984 | TruFor |
| Temporal Action Localization | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Temporal Action Localization | Casia V1+ | AUC | 0.916 | TruFor |
| Temporal Action Localization | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Anomaly Detection | COVERAGE | AUC | 0.77 | TruFor |
| Anomaly Detection | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Anomaly Detection | Columbia | AUC | 0.996 | TruFor |
| Anomaly Detection | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Anomaly Detection | CocoGlide | AUC | 0.752 | TruFor |
| Anomaly Detection | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Anomaly Detection | DSO-1 | AUC | 0.984 | TruFor |
| Anomaly Detection | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Anomaly Detection | Casia V1+ | AUC | 0.916 | TruFor |
| Anomaly Detection | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Zero-Shot Learning | COVERAGE | AUC | 0.77 | TruFor |
| Zero-Shot Learning | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Zero-Shot Learning | Columbia | AUC | 0.996 | TruFor |
| Zero-Shot Learning | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Zero-Shot Learning | CocoGlide | AUC | 0.752 | TruFor |
| Zero-Shot Learning | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Zero-Shot Learning | DSO-1 | AUC | 0.984 | TruFor |
| Zero-Shot Learning | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Zero-Shot Learning | Casia V1+ | AUC | 0.916 | TruFor |
| Zero-Shot Learning | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Activity Recognition | COVERAGE | AUC | 0.77 | TruFor |
| Activity Recognition | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Activity Recognition | Columbia | AUC | 0.996 | TruFor |
| Activity Recognition | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Activity Recognition | CocoGlide | AUC | 0.752 | TruFor |
| Activity Recognition | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Activity Recognition | DSO-1 | AUC | 0.984 | TruFor |
| Activity Recognition | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Activity Recognition | Casia V1+ | AUC | 0.916 | TruFor |
| Activity Recognition | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Action Localization | COVERAGE | AUC | 0.77 | TruFor |
| Action Localization | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Action Localization | Columbia | AUC | 0.996 | TruFor |
| Action Localization | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Action Localization | CocoGlide | AUC | 0.752 | TruFor |
| Action Localization | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Action Localization | DSO-1 | AUC | 0.984 | TruFor |
| Action Localization | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Action Localization | Casia V1+ | AUC | 0.916 | TruFor |
| Action Localization | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| 3D Action Recognition | COVERAGE | AUC | 0.77 | TruFor |
| 3D Action Recognition | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| 3D Action Recognition | Columbia | AUC | 0.996 | TruFor |
| 3D Action Recognition | Columbia | Balanced Accuracy | 0.984 | TruFor |
| 3D Action Recognition | CocoGlide | AUC | 0.752 | TruFor |
| 3D Action Recognition | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| 3D Action Recognition | DSO-1 | AUC | 0.984 | TruFor |
| 3D Action Recognition | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| 3D Action Recognition | Casia V1+ | AUC | 0.916 | TruFor |
| 3D Action Recognition | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Action Recognition | COVERAGE | AUC | 0.77 | TruFor |
| Action Recognition | COVERAGE | Balanced Accuracy | 0.68 | TruFor |
| Action Recognition | Columbia | AUC | 0.996 | TruFor |
| Action Recognition | Columbia | Balanced Accuracy | 0.984 | TruFor |
| Action Recognition | CocoGlide | AUC | 0.752 | TruFor |
| Action Recognition | CocoGlide | Balanced Accuracy | 0.639 | TruFor |
| Action Recognition | DSO-1 | AUC | 0.984 | TruFor |
| Action Recognition | DSO-1 | Balanced Accuracy | 0.93 | TruFor |
| Action Recognition | Casia V1+ | AUC | 0.916 | TruFor |
| Action Recognition | Casia V1+ | Balanced Accuracy | 0.813 | TruFor |
| Image Manipulation Localization | Columbia | Average Pixel F1(Fixed threshold) | 0.859 | TruFor |
| Image Manipulation Localization | Columbia(Protocol-CAT) | Pixel Binary F1 | 0.885 | Trufor |
| Image Manipulation Localization | NIST16(Protocol-CAT) | Pixel Binary F1 | 0.348 | Trufor |
| Image Manipulation Localization | CASIAv1(Protoclo-CAT) | Pixel Binary F1 | 0.818 | Trufor |
| Image Manipulation Localization | COVERAGE | Average Pixel F1(Fixed threshold) | 0.6 | TruFor |
| Image Manipulation Localization | COVERAGE(Protocol-CAT) | Pixel Binary F1 | 0.457 | Trufor |
| Image Manipulation Localization | Casia V1+ | Average Pixel F1(Fixed threshold) | 0.737 | TruFor |
| Image Manipulation Localization | CocoGlide | Average Pixel F1(Fixed threshold) | 0.523 | TruFor |
| Image Manipulation Localization | DSO-1 | Average Pixel F1(Fixed threshold) | 0.93 | TruFor |