Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake Detection

被引:0
|
作者
Xu, Yuting [1 ,4 ]
Liang, Jian [2 ,3 ,5 ]
Sheng, Lijun [2 ,3 ,6 ]
Zhang, Xiao-Yu [1 ,4 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, CRIPAC, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Automat, MAIS, Beijing, Peoples R China
[4] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[5] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[6] Univ Sci & Technol China, Dept Automat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Forgery detection; Thumbnail; Spatiotemporal inconsistency; Graph reasoning; Vision transformer; RECOGNITION;
D O I
10.1007/s11263-024-02054-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The deepfake threats to society and cybersecurity have provoked significant public apprehension, driving intensified efforts within the realm of deepfake video detection. Current video-level methods are mostly based on 3D CNNs resulting in high computational demands, although have achieved good performance. This paper introduces an elegantly simple yet effective strategy named Thumbnail Layout (TALL), which transforms a video clip into a pre-defined layout to realize the preservation of spatial and temporal dependencies. This transformation process involves sequentially masking frames at the same positions within each frame. These frames are then resized into sub-frames and reorganized into the predetermined layout, forming thumbnails. TALL is model-agnostic and has remarkable simplicity, necessitating only minimal code modifications. Furthermore, we introduce a graph reasoning block (GRB) and semantic consistency (SC) loss to strengthen TALL, culminating in TALL++. GRB enhances interactions between different semantic regions to capture semantic-level inconsistency clues. The semantic consistency loss imposes consistency constraints on semantic features to improve model generalization ability. Extensive experiments on intra-dataset, cross-dataset, diffusion-generated image detection, and deepfake generation method recognition show that TALL++ achieves results surpassing or comparable to the state-of-the-art methods, demonstrating the effectiveness of our approaches for various deepfake detection problems. The code is available at https://github.com/rainy-xu/TALL4Deepfake.
引用
收藏
页码:5663 / 5680
页数:18
相关论文
共 50 条
  • [1] Spatiotemporal Inconsistency Learning for DeepFake Video Detection
    Gu, Zhihao
    Chen, Yang
    Yao, Taiping
    Ding, Shouhong
    Li, Jilin
    Huang, Feiyue
    Ma, Lizhuang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3473 - 3481
  • [2] DeepFake Videos Detection via Spatiotemporal Inconsistency Learning and Interactive Fusion
    Ding, Xiangling
    Zhu, Wenjie
    Zhang, Dengyong
    2022 19TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2022, : 425 - 433
  • [3] TALL: Thumbnail Layout for Deepfake Video Detection
    Xu, Yuting
    Liang, Jian
    Jia, Gengyun
    Yang, Ziming
    Zhang, Yanhao
    He, Ran
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22601 - 22611
  • [4] GazeForensics: DeepFake detection via gaze-guided spatial inconsistency learning
    He, Qinlin
    Peng, Chunlei
    Liu, Decheng
    Wang, Nannan
    Gao, Xinbo
    NEURAL NETWORKS, 2024, 180
  • [5] Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection
    Gu, Zhihao
    Yao, Taiping
    Chen, Yang
    Ding, Shouhong
    Ma, Lizhuang
    COMPUTER VISION, ECCV 2022, PT XII, 2022, 13672 : 596 - 613
  • [6] Augmented Multi-Scale Spatiotemporal Inconsistency Magnifier for Generalized DeepFake Detection
    Yu, Yang
    Zhao, Xiaohui
    Ni, Rongrong
    Yang, Siyuan
    Zhao, Yao
    Kot, Alex C.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8487 - 8498
  • [7] Delving into the Local: Dynamic Inconsistency Learning for DeepFake Video Detection
    Gu, Zhihao
    Chen, Yang
    Yao, Taiping
    Ding, Shouhong
    Li, Jilin
    Ma, Lizhuang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 744 - 752
  • [8] DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection
    Nie, Fan
    Ni, Jiangqun
    Zhang, Jian
    Zhang, Bin
    Zhang, Weizhe
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2155 - 2167
  • [9] Capturing the Lighting Inconsistency for Deepfake Detection
    Wu, Wenxuan
    Zhou, Wenbo
    Zhang, Weiming
    Fang, Han
    Yu, Nenghai
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT II, 2022, 13339 : 637 - 647
  • [10] Deepfake detection via inter-frame inconsistency recomposition and enhancement
    Zhu, Chuntao
    Zhang, Bolin
    Yin, Qilin
    Yin, Chengxi
    Lu, Wei
    PATTERN RECOGNITION, 2024, 147