ITrans: generative image inpainting with transformers

被引:9
|
作者
Miao, Wei [1 ,4 ]
Wang, Lijun [2 ]
Lu, Huchuan [1 ]
Huang, Kaining [3 ]
Shi, Xinchu [3 ]
Liu, Bocong [3 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, 2 Linggong Rd, Dalian 116023, Peoples R China
[2] Dalian Univ Technol, Sch Artificial Intelligence, 2 Linggong Rd, Dalian 116023, Liaoning, Peoples R China
[3] Meituan Grp, 4 Wangjing East Rd, Beijing 100102, Peoples R China
[4] Univ Jyvaskyla, Fac Informat Technol, Seminaarinkatu 15, Jyvaskyla 40014, Finland
关键词
Convolutional neural network; Image inpainting; Global transformer; Local transformer; OBJECT REMOVAL;
D O I
10.1007/s00530-023-01211-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite significant improvements, convolutional neural network (CNN) based methods are struggling with handling long-range global image dependencies due to their limited receptive fields, leading to an unsatisfactory inpainting performance under complicated scenarios. To address this issue, we propose the Inpainting Transformer (ITrans) network, which combines the power of both self-attention and convolution operations. The ITrans network augments convolutional encoder-decoder structure with two novel designs, i.e. , the global and local transformers. The global transformer aggregates high-level image context from the encoder in a global perspective, and propagates the encoded global representation to the decoder in a multi-scale manner. Meanwhile, the local transformer is intended to extract low-level image details inside the local neighborhood at a reduced computational overhead. By incorporating the above two transformers, ITrans is capable of both global relationship modeling and local details encoding, which is essential for hallucinating perceptually realistic images. Extensive experiments demonstrate that the proposed ITrans network outperforms favorably against state-of-the-art inpainting methods both quantitatively and qualitatively.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] JPGNet: Joint Predictive Filtering and Generative Network for Image Inpainting
    Guo, Qing
    Li, Xiaoguang
    Juefei-Xu, Felix
    Yu, Hongkai
    Liu, Yang
    Wang, Song
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 386 - 394
  • [42] Generative Image Inpainting Based on Wavelet Transform Attention Model
    Wang, Chen
    Wang, Jin
    Zhu, Qing
    Yin, Baocai
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [43] Generative Image Inpainting with Multi-Stage Decoding Network
    Liu W.-R.
    Mi Y.-C.
    Yang F.
    Zhang Y.
    Guo H.-L.
    Liu Z.-M.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (03): : 625 - 636
  • [44] Generative image inpainting using edge prediction and appearance flow
    Liu, Qian
    Ji, Hua
    Liu, Gang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 31709 - 31725
  • [45] Medical image captioning via generative pretrained transformers
    Selivanov, Alexander
    Rogov, Oleg Y.
    Chesakov, Daniil
    Shelmanov, Artem
    Fedulova, Irina
    Dylov, Dmitry V.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [46] Medical image captioning via generative pretrained transformers
    Alexander Selivanov
    Oleg Y. Rogov
    Daniil Chesakov
    Artem Shelmanov
    Irina Fedulova
    Dmitry V. Dylov
    Scientific Reports, 13
  • [47] An image inpainting method based on generative adversarial networks inversion and autoencoder
    Wang, Yechen
    Song, Bin
    Zhang, Zhiyong
    IET IMAGE PROCESSING, 2024, 18 (04) : 1042 - 1052
  • [48] Image Multi-Inpainting via Progressive Generative Adversarial Networks
    Cai, Jiayin
    Li, Changlin
    Tao, Xin
    Tai, Yu-Wing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 977 - 986
  • [49] Predictive Filtering Integrated Generative Remote Sensing Hyperspectral Image Inpainting
    Wu, Yinhu
    Zhang, Junping
    Liu, Dongyang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [50] Multiview Scene Image Inpainting Based on Conditional Generative Adversarial Networks
    Yuan, Zefeng
    Li, Hengyu
    Liu, Jingyi
    Luo, Jun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (02): : 314 - 323