TBNet: A Two-Stream Boundary-Aware Network for Generic Image Manipulation Localization

被引:18
|
作者
Gao, Zan [1 ,2 ]
Sun, Chao [1 ]
Cheng, Zhiyong [1 ]
Guan, Weili [3 ]
Liu, Anan [4 ]
Wang, Meng [5 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250316, Shandong, Peoples R China
[2] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[3] Monash Univ, Fac Informat Technol, Clayton Campus, Clayton, Vic 3800, Australia
[4] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[5] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Splicing; Location awareness; Streaming media; Frequency-domain analysis; Task analysis; Feature extraction; Image color analysis; Adaptive cross-attention fusion; adaptive frequency selection; boundary artifact localization; generic image manipulation localization; two-stream boundary-aware; SPLICING FORGERY;
D O I
10.1109/TKDE.2022.3187091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding tampered regions in images is a common research topic in machine learning and computer vision. Although many image manipulation location algorithms have been proposed, most of them only focus on RGB images with different color spaces, and the frequency information that contains the potential tampering clues is often ignored. Moreover, among the manipulation operations, splicing and copy-move are two frequently used methods, but as their characteristics are quite different, specific methods have been individually designed for detecting the operations of either splicing or copy-move, and it is very difficult to widely apply these methods in practice. To solve these issues, in this work, a novel end-to-end two-stream boundary-aware network (abbreviated as TBNet) is proposed for generic image manipulation localization where the RGB stream, the frequency stream, and the boundary artifact location are explored in a unified framework. Specifically, we first design an adaptive frequency selection module (AFS) to adaptively select the appropriate frequency to mine inconsistent statistics and eliminate the interference of redundant statistics. Then, an adaptive cross-attention fusion module (ACF) is proposed to adaptively fuse the RGB feature and the frequency feature. Finally, the boundary artifact location network (BAL) is designed to locate the boundary artifacts for which the parameters are jointly updated by the outputs of the ACF, and its results are further fed into the decoder. Thus, the parameters of the RGB stream, the frequency stream, and the boundary artifact location network are jointly optimized, and their latent complementary relationships are fully mined. The results of the extensive experiments performed on six public benchmarks of the image manipulation localization task, namely, CASIA1.0, COVER, Carvalho, In-The-Wild, NIST-16, and IMD-2020, demonstrate that the proposed TBNet can substantially outperform state-of-the-art generic image manipulation localization methods in terms of MCC, F1, and AUC while maintaining robustness with respect to various attacks. Compared with DeepLabV3+ on the CASIA1.0, COVER, Carvalho, In-The-Wild, and NIST-16 datasets, the improvements in MCC/F1 reach 11%/11.1%, 8.2%/10.3%, 10.2%/11.6%, 8.9%/6.2%, and 13.3%/16.0%, respectively. Moreover, when IMD2020 is utilized, its AUC improvement can achieve 14.7%.
引用
收藏
页码:7541 / 7556
页数:16
相关论文
共 50 条
  • [21] Boundary-aware Segmentation Network Using Multi-Task Enhancement for Ultrasound Image
    Yu, Ruiguo
    Hu, Jiachen
    Yu, Mei
    Wei, Xi
    Jiang, Han
    Zhu, Jialin
    Liu, Zhiqiang
    Gao, Jie
    Li, Xuewei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1210 - 1214
  • [22] A two-stream network with complementary feature fusion for pest image classification
    Wang, Chao
    Zhang, Jinrui
    He, Jin
    Luo, Wei
    Yuan, Xiaohui
    Gu, Lichuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [23] A Two-Stream Symmetric Network with Bidirectional Ensemble for Aerial Image Matching
    Park, Jae-Hyun
    Nam, Woo-Jeoung
    Lee, Seong-Whan
    REMOTE SENSING, 2020, 12 (03)
  • [24] Two-stream deep sparse network for accurate and efficient image restoration
    Wang, Shuhui
    Hu, Ling
    Li, Liang
    Zhang, Weigang
    Huang, Qingming
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 200
  • [25] Two-stream encoder-decoder network for localizing image forgeries
    Mazumdar, Aniruddha
    Bora, Prabin Kumar
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 82
  • [26] A fully convolutional two-stream fusion network for interactive image segmentation
    Hu, Yang
    Soltoggio, Andrea
    Lock, Russell
    Carter, Steve
    NEURAL NETWORKS, 2019, 109 : 31 - 42
  • [27] Two-stream spatiotemporal image fusion network based on difference transformation
    Fang, Shuai
    Meng, Siyuan
    Zhang, Jing
    Cao, Yang
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (03)
  • [28] TWO-STREAM SPARSE NETWORK FOR ACCURATE IMAGE SUPER-RESOLUTION
    Hu, Ling
    Wang, Shuhui
    Li, Liang
    Huang, Qingming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 258 - 263
  • [29] Remote Sensing Image Fusion Based on Two-Stream Fusion Network
    Liu, Xiangyu
    Wang, Yunhong
    Liu, Qingjie
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 428 - 439
  • [30] Remote sensing image fusion based on two-stream fusion network
    Liu, Xiangyu
    Liu, Qingjie
    Wang, Yunhong
    INFORMATION FUSION, 2020, 55 : 1 - 15