CTNet: hybrid architecture based on CNN and transformer for image inpainting detection

被引:6
|
作者
Xiao, Fengjun [1 ]
Zhang, Zhuxi [2 ]
Yao, Ye [2 ]
机构
[1] Hangzhou Dianzi Univ, Zhejiang Informatizat Dev Inst, Xiasha Higher Educ Zone, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Cyberspace, Xiasha Higher Educ Zone, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Image inpainting detection; Deep neural network; Hybrid CNN-Transformer encoder; High-pass filter; DIFFUSION; NETWORK;
D O I
10.1007/s00530-023-01184-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital image inpainting technology has increasingly gained popularity as a result of the development of image processing and machine vision. However, digital image inpainting can be used not only to repair damaged photographs, but also to remove specific people or distort the semantic content of images. To address the issue of image inpainting forgeries, a hybrid CNN-Transformer Network (CTNet), which is composed of the hybrid CNN-Transformer encoder, the feature enhancement module, and the decoder module, is proposed for image inpainting detection and localization. Different from existing inpainting detection methods that rely on hand-crafted attention mechanisms, the hybrid CNN-Transformer encoder employs CNN as a feature extractor to build feature maps tokenized as the input patches of the Transformer encoder. The hybrid structure exploits the innate global self-attention mechanisms of Transformer and can effectively capture the long-term dependency of the image. Since inpainting traces mainly exist in the high-frequency components of digital images, the feature enhancement module performs feature extraction in the frequency domain. The decoder regularizes the upsampling process of the predicted masks with the assistance of high-frequency features. We investigate the generalization capacity of our CTNet on datasets generated by ten commonly used inpainting methods. The experimental results show that the proposed model can detect a variety of unknown inpainting operations after being trained on the datasets generated by a single inpainting method.
引用
收藏
页码:3819 / 3832
页数:14
相关论文
共 50 条
  • [21] Multiscale fire image detection method based on CNN and Transformer
    Shengbao Wu
    Buyun Sheng
    Gaocai Fu
    Daode Zhang
    Yuchao Jian
    Multimedia Tools and Applications, 2024, 83 : 49787 - 49811
  • [22] Multiscale fire image detection method based on CNN and Transformer
    Wu, Shengbao
    Sheng, Buyun
    Fu, Gaocai
    Zhang, Daode
    Jian, Yuchao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 49787 - 49811
  • [23] Enhanced hybrid CNN and transformer network for remote sensing image change detection
    Yang, Junjie
    Wan, Haibo
    Shang, Zhihai
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [24] Medical Image Classification with a Hybrid SSM Model Based on CNN and Transformer
    Hu, Can
    Cao, Ning
    Zhou, Han
    Guo, Bin
    ELECTRONICS, 2024, 13 (15)
  • [25] HTC-Grasp: A Hybrid Transformer-CNN Architecture for Robotic Grasp Detection
    Zhang, Qiang
    Zhu, Jianwei
    Sun, Xueying
    Liu, Mingmin
    ELECTRONICS, 2023, 12 (06)
  • [26] Wild horseshoe crab image denoising based on CNN-transformer architecture
    Han, Lili
    Liu, Xiuping
    Wang, Qingqing
    Xu, Tao
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [27] A Hybrid Parallel Computing Architecture Based on CNN and Transformer for Music Genre Classification
    Chen, Jiyang
    Ma, Xiaohong
    Li, Shikuan
    Ma, Sile
    Zhang, Zhizheng
    Ma, Xiaojing
    ELECTRONICS, 2024, 13 (16)
  • [28] Hybrid Architecture Based on CNN and Transformer for Strip Steel Surface Defect Classification
    Li, Shunfeng
    Wu, Chunxue
    Xiong, Naixue
    ELECTRONICS, 2022, 11 (08)
  • [29] Encoder-decoder-based CNN model for detection of object removal by image inpainting
    Kumar, Nitish
    Meenpal, Toshanlal
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [30] An Edge-Aware Transformer Framework for Image Inpainting Detection
    Hu, Liangpei
    Li, Yuanman
    You, Jiaxiang
    Liang, Rongqin
    Li, Xia
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT II, 2022, 13339 : 648 - 660