TBNet: A Two-Stream Boundary-Aware Network for Generic Image Manipulation Localization

被引：18

作者：

Gao, Zan ^{[1
,2
]}

Sun, Chao ^{[1
]}

Cheng, Zhiyong ^{[1
]}

Guan, Weili ^{[3
]}

Liu, Anan ^{[4
]}

Wang, Meng ^{[5
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250316, Shandong, Peoples R China

[2] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China

[3] Monash Univ, Fac Informat Technol, Clayton Campus, Clayton, Vic 3800, Australia

[4] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[5] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2023年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Splicing; Location awareness; Streaming media; Frequency-domain analysis; Task analysis; Feature extraction; Image color analysis; Adaptive cross-attention fusion; adaptive frequency selection; boundary artifact localization; generic image manipulation localization; two-stream boundary-aware; SPLICING FORGERY;

D O I：

10.1109/TKDE.2022.3187091

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Finding tampered regions in images is a common research topic in machine learning and computer vision. Although many image manipulation location algorithms have been proposed, most of them only focus on RGB images with different color spaces, and the frequency information that contains the potential tampering clues is often ignored. Moreover, among the manipulation operations, splicing and copy-move are two frequently used methods, but as their characteristics are quite different, specific methods have been individually designed for detecting the operations of either splicing or copy-move, and it is very difficult to widely apply these methods in practice. To solve these issues, in this work, a novel end-to-end two-stream boundary-aware network (abbreviated as TBNet) is proposed for generic image manipulation localization where the RGB stream, the frequency stream, and the boundary artifact location are explored in a unified framework. Specifically, we first design an adaptive frequency selection module (AFS) to adaptively select the appropriate frequency to mine inconsistent statistics and eliminate the interference of redundant statistics. Then, an adaptive cross-attention fusion module (ACF) is proposed to adaptively fuse the RGB feature and the frequency feature. Finally, the boundary artifact location network (BAL) is designed to locate the boundary artifacts for which the parameters are jointly updated by the outputs of the ACF, and its results are further fed into the decoder. Thus, the parameters of the RGB stream, the frequency stream, and the boundary artifact location network are jointly optimized, and their latent complementary relationships are fully mined. The results of the extensive experiments performed on six public benchmarks of the image manipulation localization task, namely, CASIA1.0, COVER, Carvalho, In-The-Wild, NIST-16, and IMD-2020, demonstrate that the proposed TBNet can substantially outperform state-of-the-art generic image manipulation localization methods in terms of MCC, F1, and AUC while maintaining robustness with respect to various attacks. Compared with DeepLabV3+ on the CASIA1.0, COVER, Carvalho, In-The-Wild, and NIST-16 datasets, the improvements in MCC/F1 reach 11%/11.1%, 8.2%/10.3%, 10.2%/11.6%, 8.9%/6.2%, and 13.3%/16.0%, respectively. Moreover, when IMD2020 is utilized, its AUC improvement can achieve 14.7%.

引用

页码：7541 / 7556

页数：16

共 50 条

[21] Boundary-aware Segmentation Network Using Multi-Task Enhancement for Ultrasound Image
Yu, Ruiguo
Hu, Jiachen
Yu, Mei
Wei, Xi
Jiang, Han
Zhu, Jialin
Liu, Zhiqiang
Gao, Jie
Li, Xuewei
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1210 - 1214
[22] A two-stream network with complementary feature fusion for pest image classification
Wang, Chao
Zhang, Jinrui
He, Jin
Luo, Wei
Yuan, Xiaohui
Gu, Lichuan
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
[23] A Two-Stream Symmetric Network with Bidirectional Ensemble for Aerial Image Matching
Park, Jae-Hyun
Nam, Woo-Jeoung
Lee, Seong-Whan
REMOTE SENSING, 2020, 12 (03)
[24] Two-stream deep sparse network for accurate and efficient image restoration
Wang, Shuhui
Hu, Ling
Li, Liang
Zhang, Weigang
Huang, Qingming
COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 200
[25] Two-stream encoder-decoder network for localizing image forgeries
Mazumdar, Aniruddha
Bora, Prabin Kumar
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 82
[26] A fully convolutional two-stream fusion network for interactive image segmentation
Hu, Yang
Soltoggio, Andrea
Lock, Russell
Carter, Steve
NEURAL NETWORKS, 2019, 109 : 31 - 42
[27] Two-stream spatiotemporal image fusion network based on difference transformation
Fang, Shuai
Meng, Siyuan
Zhang, Jing
Cao, Yang
JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (03)
[28] TWO-STREAM SPARSE NETWORK FOR ACCURATE IMAGE SUPER-RESOLUTION
Hu, Ling
Wang, Shuhui
Li, Liang
Huang, Qingming
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 258 - 263
[29] Remote Sensing Image Fusion Based on Two-Stream Fusion Network
Liu, Xiangyu
Wang, Yunhong
Liu, Qingjie
MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 428 - 439
[30] Remote sensing image fusion based on two-stream fusion network
Liu, Xiangyu
Liu, Qingjie
Wang, Yunhong
INFORMATION FUSION, 2020, 55 : 1 - 15

← 1 2 3 4 5 →