Local Region Frequency Guided Dynamic Inconsistency Network for Deepfake Video Detection

被引:2
|
作者
Yue, Pengfei [1 ,2 ,3 ]
Chen, Beijing [1 ,2 ,3 ]
Fu, Zhangjie [1 ,2 ,3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Centerof Atmospher Envi, Nanjing 210044, Peoples R China
来源
BIG DATA MINING AND ANALYTICS | 2024年 / 7卷 / 03期
基金
中国国家自然科学基金;
关键词
Deepfakes; Privacy; Frequency-domain analysis; Mouth; Lighting; Stability analysis; Spatiotemporal phenomena; deepfake video detection; dynamic inconsistency; local region; local region frequency;
D O I
10.26599/BDMA.2024.9020030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, with the rapid development of deepfake technology, a large number of deepfake videos have emerged on the Internet, which poses a huge threat to national politics, social stability, and personal privacy. Although many existing deepfake detection methods exhibit excellent performance for known manipulations, their detection capabilities are not strong when faced with unknown manipulations. Therefore, in order to obtain better generalization ability, this paper analyzes global and local inter-frame dynamic inconsistencies from the perspective of spatial and frequency domains, and proposes a Local region Frequency Guided Dynamic Inconsistency Network (LFGDIN). The network includes two parts: Global SpatioTemporal Network (GSTN) and Local Region Frequency Guided Module (LRFGM). The GSTN is responsible for capturing the dynamic information of the entire face, while the LRFGM focuses on extracting the frequency dynamic information of the eyes and mouth. The LRFGM guides the GTSN to concentrate on dynamic inconsistency in some significant local regions through local region alignment, so as to improve the model's detection performance. Experiments on the three public datasets (FF++, DFDC, and Celeb-DF) show that compared with many recent advanced methods, the proposed method achieves better detection results when detecting deepfake videos of unknown manipulation types.
引用
收藏
页码:889 / 904
页数:16
相关论文
共 50 条
  • [41] Attention -guided dual spatial -temporal non -local network for video super -resolution
    Sun, Wei
    Zhang, Yanning
    NEUROCOMPUTING, 2020, 406 : 24 - 33
  • [42] A local spatiotemporal optimization framework for video saliency detection using region covariance
    Tian C.
    Jiang Q.
    Wu Z.
    Liu T.
    Hu L.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2016, 38 (07): : 1586 - 1593
  • [43] Multi-attribute self-attention guided vehicle local region detection based on convolutional neural network architecture
    Chen, Jingbo
    Chen, Shengyong
    Bian, Linjie
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (04)
  • [44] Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection
    Yang, Zhiwei
    Wu, Peng
    Liu, Jing
    Liu, Xiaotao
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 404 - 421
  • [45] Local-Global Dynamic Filtering Network for Video Super-Resolution
    Zhang, Chaopeng
    Wang, Xingtao
    Xiong, Ruiqin
    Fan, Xiaopeng
    Zhao, Debin
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2023, 9 : 963 - 976
  • [46] Efficient video frame insertion and deletion detection based on inconsistency of correlations between local binary pattern coded frames
    Zhang, Zhenzhen
    Hou, Jianjun
    Ma, Qinglong
    Li, Zhaohong
    SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (02) : 311 - 320
  • [47] LW-DeepFakeNet: a lightweight time distributed CNN-LSTM network for real-time DeepFake video detection
    Umar Masud
    Mohd. Sadiq
    Sarfaraz Masood
    Musheer Ahmad
    Ahmed A. Abd El-Latif
    Signal, Image and Video Processing, 2023, 17 : 4029 - 4037
  • [48] LW-DeepFakeNet: a lightweight time distributed CNN-LSTM network for real-time DeepFake video detection
    Masud, Umar
    Sadiq, Mohd
    Masood, Sarfaraz
    Ahmad, Musheer
    Abd El-Latif, Ahmed A.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4029 - 4037
  • [49] Saliency-Guided Region Proposal Network for CNN Based Object Detection
    Fattal, Ann-Katrin
    Karg, Michelle
    Scharfenberger, Christian
    Adamy, Juergen
    2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [50] DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network
    Thakare, Kamalakar Vijay
    Raghuwanshi, Yash
    Dogra, Debi Prosad
    Choi, Heeseung
    Kim, Ig-Jae
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5530 - 5539