Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake Detection

被引:0
|
作者
Xu, Yuting [1 ,4 ]
Liang, Jian [2 ,3 ,5 ]
Sheng, Lijun [2 ,3 ,6 ]
Zhang, Xiao-Yu [1 ,4 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, CRIPAC, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Automat, MAIS, Beijing, Peoples R China
[4] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[5] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[6] Univ Sci & Technol China, Dept Automat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Forgery detection; Thumbnail; Spatiotemporal inconsistency; Graph reasoning; Vision transformer; RECOGNITION;
D O I
10.1007/s11263-024-02054-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The deepfake threats to society and cybersecurity have provoked significant public apprehension, driving intensified efforts within the realm of deepfake video detection. Current video-level methods are mostly based on 3D CNNs resulting in high computational demands, although have achieved good performance. This paper introduces an elegantly simple yet effective strategy named Thumbnail Layout (TALL), which transforms a video clip into a pre-defined layout to realize the preservation of spatial and temporal dependencies. This transformation process involves sequentially masking frames at the same positions within each frame. These frames are then resized into sub-frames and reorganized into the predetermined layout, forming thumbnails. TALL is model-agnostic and has remarkable simplicity, necessitating only minimal code modifications. Furthermore, we introduce a graph reasoning block (GRB) and semantic consistency (SC) loss to strengthen TALL, culminating in TALL++. GRB enhances interactions between different semantic regions to capture semantic-level inconsistency clues. The semantic consistency loss imposes consistency constraints on semantic features to improve model generalization ability. Extensive experiments on intra-dataset, cross-dataset, diffusion-generated image detection, and deepfake generation method recognition show that TALL++ achieves results surpassing or comparable to the state-of-the-art methods, demonstrating the effectiveness of our approaches for various deepfake detection problems. The code is available at https://github.com/rainy-xu/TALL4Deepfake.
引用
收藏
页码:5663 / 5680
页数:18
相关论文
共 50 条
  • [41] Masked Relation Learning for DeepFake Detection
    Yang, Ziming
    Liang, Jian
    Xu, Yuting
    Zhang, Xiao-Yu
    He, Ran
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1696 - 1708
  • [42] Generalized Deepfake Detection Algorithm Based on Inconsistency Between Inner and Outer Faces
    Gao, Jie
    Concas, Sara
    Orru, Giulia
    Feng, Xiaoyi
    Marcialis, Gian Luca
    Roli, Fabio
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 343 - 355
  • [43] Deepfake face detection via multi-level discrete wavelet transform and vision transformer
    Uddin, Main
    Fu, Zhangjie
    Zhang, Xiang
    VISUAL COMPUTER, 2025,
  • [44] DeepFake Detection Using Deep Learning
    Mansoor, Nazneen
    Iliev, Alexander Iliev
    INTELLIGENT COMPUTING, VOL 3, 2024, 2024, 1018 : 202 - 213
  • [45] Implicit Identity Driven Deepfake Face Swapping Detection
    Huang, Baojin
    Wang, Zhongyuan
    Yang, Jifan
    Ai, Jiaxin
    Zou, Qin
    Wang, Qian
    Ye, Dengpan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4490 - 4499
  • [46] DeepFake Detection for Human Face Images and Videos: A Survey
    Malik, Asad
    Kuribayashi, Minoru
    Abdullahi, Sani M.
    Khan, Ahmad Neyaz
    IEEE ACCESS, 2022, 10 : 18757 - 18775
  • [47] Common Forgery Artifact Driven Deepfake Face Detection
    Wu, Haotian
    Wang, Xin
    Wang, Ruobing
    Xiang, Ji
    Ren, Liyue
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1585 - 1590
  • [48] Privacy-preserving DeepFake face image detection
    Chen, Beijing
    Liu, Xin
    Xia, Zhihua
    Zhao, Guoying
    DIGITAL SIGNAL PROCESSING, 2023, 143
  • [49] A dual descriptor combined with frequency domain reconstruction learning for face forgery detection in deepfake videos
    Jin, Xin
    Wu, Nan
    Jiang, Qian
    Kou, Yuru
    Duan, Hanxian
    Wang, Puming
    Yao, Shaowen
    FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION, 2024, 49
  • [50] MSVT: Multiple Spatiotemporal Views Transformer for DeepFake Video Detection
    Yu, Yang
    Ni, Rongrong
    Zhao, Yao
    Yang, Siyuan
    Xia, Fen
    Jiang, Ning
    Zhao, Guoqing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4462 - 4471