Deepfake Video Detection Model Based on Consistency of Spatial-Temporal Features

被引:0
|
作者
Zhao L. [1 ,2 ]
Ge W. [1 ,2 ]
Mao Y. [2 ]
Han M. [2 ]
Li W. [2 ]
Li X. [2 ]
机构
[1] Key Lab. of Aerospace Info. Security and Trusted Computing, Ministry of Education, Wuhan Univ., Wuhan
[2] School of Cyber Sci. and Eng., Wuhan Univ., Wuhan
关键词
Deepfake detection; Fake images; Spatial features; Temporal features;
D O I
10.15961/j.jsuese.201900902
中图分类号
学科分类号
摘要
In order to improve the feature utilization rate of the image to be detected, a Deepfake video detection model based on consistency of spatial-temporal features was proposed, inspired by the observation that there are slight inconsistency and discontinuity in the facial expression changes of the characters in Deepfake videos. In the model, the convolutional neural network (CNN) was used to extract the spatial features from the video to be detected, and an optical flow method was used to perform temporal features between consecutive frames of the video. Then another CNN was used to extract the abstract and in-depth features from the optical flow map. After the temporal features and spatial features were transformed from original representation space to a new feature space by neural networks, a fully connected neural network was used to classify the combined spatial and temporal features space to achieve the detection target. The model proposed in the paper was trained on the Faceforensics++, an open source Deepfake dataset. The experimental results indicated that the detection accuracy of the proposed model reaches 98.1%, and the AUC value reaches 0.998 1. By comparing with the existing Deepfake detection models, the proposed model is superior to the existing models in terms of detection accuracy and AUC value, which verifies the effectiveness of the proposed model. © 2020, Editorial Department of Advanced Engineering Sciences. All right reserved.
引用
收藏
页码:243 / 250
页数:7
相关论文
共 22 条
  • [1] Github deepfakes.faceswap
  • [2] Github iperov.Deepfacelab
  • [3] Todd Ditchendorf.Fakeapp
  • [4] Guera D, Delp E J., Deepfake video detection using recurrent neural networks, Proceedings of the 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 1-6, (2018)
  • [5] Li Y, Chang M C, Lyu S., In ictu oculi: Exposing ai created fake videos by detecting eye blinking, Proceedings of the 2018 IEEE International Workshop on Information Forensics and Security (WIFS), pp. 1-7, (2018)
  • [6] Li Y, Lyu S., Exposing deepfake videos by detecting face warping artifacts
  • [7] Yang X, Li Y, Lyu S., Exposing deep fakes using inconsistent head poses, Proceedings of 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), pp. 8261-8265, (2019)
  • [8] Afchar D, Nozick V, Yamagishi J, Et al., Mesonet: A compact facial video forgery detection network, Proceedings of the 2018 IEEE International Workshop on Information Forensics and Security (WIFS), pp. 1-7, (2018)
  • [9] Nguyen H H, Yamagishi J, Echizen I., Capsule-forensics: Using capsule networks to detect forged images and videos, Proceedings of 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), pp. 2307-2311, (2019)
  • [10] Bradski G, Kaebler A., (2009)