On the Generalization of Deep Learning Models in Video Deepfake Detection

被引:6
|
作者
Coccomini, Davide Alessandro [1 ]
Caldelli, Roberto [2 ,3 ]
Falchi, Fabrizio [1 ]
Gennaro, Claudio [1 ]
机构
[1] Ist Sci & Tecnol Informaz, I-56124 Pisa, Italy
[2] Natl Interuniv Consortium Telecommun CNIT, I-50134 Florence, Italy
[3] Univ Mercatorum, Fac Econ, I-00186 Rome, Italy
关键词
deepfake detection; deep learning; computer vision; generalization; IMAGE;
D O I
10.3390/jimaging9050089
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
The increasing use of deep learning techniques to manipulate images and videos, commonly referred to as "deepfakes", is making it more challenging to differentiate between real and fake content, while various deepfake detection systems have been developed, they often struggle to detect deepfakes in real-world situations. In particular, these methods are often unable to effectively distinguish images or videos when these are modified using novel techniques which have not been used in the training set. In this study, we carry out an analysis of different deep learning architectures in an attempt to understand which is more capable of better generalizing the concept of deepfake. According to our results, it appears that Convolutional Neural Networks (CNNs) seem to be more capable of storing specific anomalies and thus excel in cases of datasets with a limited number of elements and manipulation methodologies. The Vision Transformer, conversely, is more effective when trained with more varied datasets, achieving more outstanding generalization capabilities than the other methods analysed. Finally, the Swin Transformer appears to be a good alternative for using an attention-based method in a more limited data regime and performs very well in cross-dataset scenarios. All the analysed architectures seem to have a different way to look at deepfakes, but since in a real-world environment the generalization capability is essential, based on the experiments carried out, the attention-based architectures seem to provide superior performances.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models
    Lam Pham
    Phat Lam
    Truong Nguyen
    Huyen Nguyen
    Schindler, Alexander
    2024 IEEE 5TH INTERNATIONAL SYMPOSIUM ON THE INTERNET OF SOUNDS, IS2 2024, 2024, : 170 - 174
  • [32] Comprehensive Evaluation of Deepfake Detection Models: Accuracy, Generalization, and Resilience to Adversarial Attacks
    Abbasi, Maryam
    Vaz, Paulo
    Silva, Jose
    Martins, Pedro
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [33] Using Graph Neural Networks to Improve Generalization Capability of the Models for Deepfake Detection
    She, Huimin
    Hu, Yongjian
    Liu, Beibei
    Li, Jicheng
    Li, Chang-Tsun
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 8414 - 8427
  • [34] Improving Generalization of Deepfake Detection With Data Farming and Few-Shot Learning
    Korshunov, Pavel
    Marcel, Sebastien
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2022, 4 (03): : 386 - 397
  • [35] Generalization of Forgery Detection With Meta Deepfake Detection Model
    Tran, Van-Nhan
    Kwon, Seong-Geun
    Lee, Suk-Hwan
    Le, Hoanh-Su
    Kwon, Ki-Ryong
    IEEE ACCESS, 2023, 11 : 535 - 546
  • [36] Real-Time Deepfake Video Detection Using Eye Movement Analysis with a Hybrid Deep Learning Approach
    Javed, Muhammad
    Zhang, Zhaohui
    Dahri, Fida Hussain
    Laghari, Asif Ali
    ELECTRONICS, 2024, 13 (15)
  • [37] Improving Deepfake Video Detection with Comprehensive Self-consistency Learning
    Bao, Heng
    Deng, Lirui
    Guan, Jiazhi
    Zhang, Liang
    Chen, Xunxun
    CYBER SECURITY, CNCERT 2022, 2022, 1699 : 151 - 161
  • [38] Fully Unsupervised Deepfake Video Detection Via Enhanced Contrastive Learning
    Qiao, Tong
    Xie, Shichuang
    Chen, Yanli
    Retraint, Florent
    Luo, Xiangyang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (07) : 4654 - 4668
  • [39] Improving Generalization of Deepfake Detection by Training for Attribution
    Jain, Anubhav
    Korshunov, Pavel
    Marcel, Sebastien
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [40] CDNET: CLUSTER DECISION FOR DEEPFAKE DETECTION GENERALIZATION
    Hou, Zeming
    Hua, Zhongyun
    Zhang, Kuiyuan
    Zhang, Yushu
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3010 - 3014