Deepfake Video Detection Based on Spatial, Spectral, and Temporal Inconsistencies Using Multimodal Deep Learning

被引:22
|
作者
Lewis, John K. [1 ]
Toubal, Imad Eddine [2 ]
Chen, Helen [3 ]
Sandesera, Vishal [4 ]
Lomnitz, Michael [4 ]
Hampel-Arias, Zigfried [4 ]
Prasad, Calyam [2 ]
Palaniappan, Kannappan [2 ]
机构
[1] Florida Southern Coll, Lakeland, FL 33801 USA
[2] Univ Missouri, Columbia, MO 65211 USA
[3] Univ Maryland, College Pk, MD USA
[4] IQT Labs, Arlington, VA USA
基金
美国国家科学基金会;
关键词
deepfake detection; deep learning; multi-modal; computer vision;
D O I
10.1109/AIPR50011.2020.9425167
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Authentication of digital media has become an everpressing necessity for modern society. Since the introduction of Generative Adversarial Networks (GANs), synthetic media has become increasingly difficult to identify. Synthetic videos that contain altered faces and/or voices of a person are known as deepfakes and threaten trust and privacy in digital media. Deepfakes can be weaponized for political advantage, slander, and to undermine the reputation of public figures. Despite imperfections of deepfakes, people struggle to distinguish between authentic and manipulated images and videos. Consequently, it is important to have automated systems that accurately and efficiently classify the validity of digital content. Many recent deepfake detection methods use single frames of video and focus on the spatial information in the image to infer the authenticity of the video. Some promising approaches exploit the temporal inconsistencies of manipulated videos; however, research primarily focuses on spatial features. We propose a hybrid deep learning approach that uses spatial, spectral, and temporal content that is coupled in a consistent way to differentiate real and fake videos. We show that the Discrete Cosine transform can improve deepfake detection by capturing spectral features of individual frames. In this work, we build a multimodal network that explores new features to detect deepfake videos, achieving 61.95% accuracy on the Facebook Deepfake Detection Challenge (DFDC) dataset.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] An efficient cybersecurity framework for facial video forensics detection based on multimodal deep learning
    Ahmed Sedik
    Osama S. Faragallah
    Hala S. El-sayed
    Ghada M. El-Banby
    Fathi E. Abd El-Samie
    Ashraf A. M. Khalaf
    Walid El-Shafai
    Neural Computing and Applications, 2022, 34 : 1251 - 1268
  • [32] Deepfake Detection Using Robust Spatial and Temporal Features from Facial Landmarks
    Li, Meng
    Liu, Beibei
    Hu, Yongjian
    Zhang, Liepiao
    Wang, Shiqi
    2021 9TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF 2021), 2021,
  • [33] An efficient cybersecurity framework for facial video forensics detection based on multimodal deep learning
    Sedik, Ahmed
    Faragallah, Osama S.
    El-sayed, Hala S.
    El-Banby, Ghada M.
    El-Samie, Fathi E. Abd
    Khalaf, Ashraf A. M.
    El-Shafai, Walid
    Neural Computing and Applications, 2022, 34 (02) : 1251 - 1268
  • [34] Video Anomaly Detection Using Optimization Based Deep Learning
    Gayal, Baliram Sambhaji
    Patil, Sandip Raosaheb
    UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 249 - 264
  • [35] A Deep Learning Framework for Audio Deepfake Detection
    Janavi Khochare
    Chaitali Joshi
    Bakul Yenarkar
    Shraddha Suratkar
    Faruk Kazi
    Arabian Journal for Science and Engineering, 2022, 47 : 3447 - 3458
  • [36] Deepfake detection using deep learning methods: A systematic and comprehensive review
    Heidari, Arash
    Navimipour, Nima Jafari
    Dag, Hasan
    Unal, Mehmet
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 14 (02)
  • [37] Toward Sequential Deepfake Detection Using Deep Learning for Privacy Protection
    Zhang, Guisheng
    Li, Qilei
    Gao, Mingliang
    Jeon, Gwanggil
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2025, 14 (02) : 42 - 48
  • [38] A review of deep learning-based approaches for deepfake content detection
    Passos, Leandro A.
    Jodas, Danilo
    Costa, Kelton A. P.
    Souza, Luis A.
    Rodrigues, Douglas
    Del Ser, Javier
    Camacho, David
    Papa, Joao Paulo
    EXPERT SYSTEMS, 2024, 41 (08)
  • [39] Spatio-temporal knowledge distilled video vision transformer (STKD-VViT) for multimodal deepfake detection
    Usmani, Shaheen
    Kumar, Sunil
    Sadhya, Debanjan
    NEUROCOMPUTING, 2025, 620
  • [40] DeepfakeStack: A Deep Ensemble-based Learning Technique for Deepfake Detection
    Rana, Md Shohel
    Sung, Andrew H.
    2020 7TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (CSCLOUD 2020)/2020 6TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (EDGECOM 2020), 2020, : 70 - 75