Exploiting Complementary Dynamic Incoherence for DeepFake Video Detection

被引:22
|
作者
Wang, Hanyi [1 ]
Liu, Zihan [1 ]
Wang, Shilin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
DeepFake video detection; video forensics;
D O I
10.1109/TCSVT.2023.3238517
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, manipulated videos based on DeepFake technology have spread widely on social media, causing concerns about the authenticity of video content and personal privacy protection. Although existing DeepFake detection methods achieve remarkable progress in some specific scenarios, their detection performance usually drops drastically when detecting unseen manipulation methods. Compared with static information such as human face, dynamic information depicting the movements of facial features is more difficult to forge without leaving visual or statistical traces. Hence, in order to achieve better generalization ability, we focus on dynamic information analysis to disclose such traces and propose a novel Complementary Dynamic Interaction Network (CDIN). Inspired by the DeepFake detection methods based on mouth region analysis, both the global (entire face) and local (mouth region) dynamics are analyzed with properly designed network branches, respectively, and their feature maps at various levels are communicated with each other using a newly proposed Complementary Cross Dynamics Fusion Module (CCDFM). With CCDFM, the global branch will pay more attention to anomalous mouth movements and the local branch will gain more information about the global context. Finally, a multi-task learning scheme is designed to optimize the network with both the global and local information. Extensive experiments have demonstrated that our approach achieves better detection results compared with several SOTA methods, especially in detecting video forgeries manipulated by unseen methods.
引用
收藏
页码:4027 / 4040
页数:14
相关论文
共 50 条
  • [21] Video Transformer for Deepfake Detection with Incremental Learning
    Khan, Sohail Ahmed
    Dai, Hang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1821 - 1828
  • [22] Self-Supervised Video Representation Learning by Video Incoherence Detection
    Cao, Haozhi
    Xu, Yuecong
    Mao, Kezhi
    Xie, Lihua
    Yin, Jianxiong
    See, Simon
    Xu, Qianwen
    Yang, Jianfei
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (06) : 3810 - 3822
  • [23] TCSD: Triple Complementary Streams Detector for Comprehensive Deepfake Detection
    Liu, Xiaolong
    Yu, Yang
    Li, Xiaolong
    Zhao, Yao
    Guo, Guodong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [24] A hierarchical feature selection strategy for deepfake video detection
    Mohiuddin, Sk
    Sheikh, Khalid Hassan
    Malakar, Samir
    Velasquez, Juan D.
    Sarkar, Ram
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (13): : 9363 - 9380
  • [25] Sharp Multiple Instance Learning for DeepFake Video Detection
    Li, Xiaodan
    Lang, Yining
    Chen, Yuefeng
    Mao, Xiaofeng
    He, Yuan
    Wang, Shuhui
    Xue, Hui
    Lu, Quan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1864 - 1872
  • [26] A hierarchical feature selection strategy for deepfake video detection
    Sk Mohiuddin
    Khalid Hassan Sheikh
    Samir Malakar
    Juan D. Velásquez
    Ram Sarkar
    Neural Computing and Applications, 2023, 35 : 9363 - 9380
  • [27] On the Generalization of Deep Learning Models in Video Deepfake Detection
    Coccomini, Davide Alessandro
    Caldelli, Roberto
    Falchi, Fabrizio
    Gennaro, Claudio
    JOURNAL OF IMAGING, 2023, 9 (05)
  • [28] Combining EfficientNet and Vision Transformers for Video Deepfake Detection
    Coccomini, Davide Alessandro
    Messina, Nicola
    Gennaro, Claudio
    Falchi, Fabrizio
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 219 - 229
  • [29] Deepfake Video Detection Using Recurrent Neural Networks
    Guera, David
    Delp, Edward J.
    2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2018, : 127 - 132
  • [30] Deepfake Video Detection via Predictive Representation Learning
    Ge, Shiming
    Lin, Fanzhao
    Li, Chenyu
    Zhang, Daichi
    Wang, Weiping
    Zeng, Dan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)