Adaptive Visual Field Multi-scale Generative Adversarial Networks Image Inpainting Base on Coordinate-Attention

被引:2
|
作者
Chen, Gang [1 ,2 ]
Kang, Peipei [2 ]
Wu, Xingcai [2 ]
Yang, Zhenguo [2 ]
Liu, Wenyin [2 ,3 ]
机构
[1] Guangdong Open Univ, Sch Artificial Intelligence, Guangzhou, Peoples R China
[2] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
[3] Cyberspace Secur Res Ctr, Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Image inpainting; Deformable convolutional networks; Coordinate-attention; Multi-Scale GANs; OBJECT REMOVAL; RECONSTRUCTION; ALGORITHM;
D O I
10.1007/s11063-023-11233-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image inpainting with the large missing blocks is tremendous challenging to achieve visual consistency and realistic effect. In this paper, an Adaptive Visual field Multi-scale Generative Adversarial Networks (denoted as GANs) Image Inpainting based on Coordinate-attention (denoted as AVMGC) is proposed. Firstly, an encoder with deformable convolutional networks in the generator of multi-scale generative adversarial networks is designed to expand the local vision field of network sampling adaptively in the image inpainting, which improves the local visual consistency of the image inpainting. Secondly, in order to expand the receptive field of the deep network and the global visual field, AVMGC combines the coordinate-attention mechanism with the convolutional layers, aiming to capture the direction-aware and position-sensitive information by cross-channel, which helps models to more accurately locate and recognize the objects of interest and generate globally consistent geometric contour in the image inpainting. In particular, instance normalization is introduced to the mutil-scale discriminator for transferring the statistic information of the feature maps and aims to keep the style of the original images. Extensive experiments conducted on public datasets prove that the proposal algorithms have the qualitative performance and outperform the baselines.
引用
收藏
页码:9949 / 9967
页数:19
相关论文
共 50 条
  • [21] Multi-scale self-attention generative adversarial network for pathology image restoration
    Meiyan Liang
    Qiannan Zhang
    Guogang Wang
    Na Xu
    Lin Wang
    Haishun Liu
    Cunlin Zhang
    The Visual Computer, 2023, 39 : 4305 - 4321
  • [22] Multi-scale self-attention generative adversarial network for pathology image restoration
    Liang, Meiyan
    Zhang, Qiannan
    Wang, Guogang
    Xu, Na
    Wang, Lin
    Liu, Haishun
    Zhang, Cunlin
    VISUAL COMPUTER, 2023, 39 (09): : 4305 - 4321
  • [23] Image inpainting via Multi-scale Adaptive Priors
    Wang, Yufeng
    Guo, Dongsheng
    Zhao, Haoru
    Yang, Min
    Zheng, Haiyong
    PATTERN RECOGNITION, 2025, 162
  • [24] Multi-scale Generative Adversarial Networks for Speech Enhancement
    Li, Yihang
    Jiang, Ting
    Qin, Shan
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [25] Multi-scale Generative Adversarial Networks for Crowd Counting
    Yang, Jianxing
    Zhou, Yuan
    Kung, Sun-Yuan
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3244 - 3249
  • [26] Image Inpainting Based Multi-scale Gated Convolution and Attention
    Jiang, Hualiang
    Ma, Xiaohu
    Yang, Dongdong
    Zhao, Jiaxin
    Shen, Yao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 407 - 418
  • [27] MUSICAL: Multi-Scale Image Contextual Attention Learning for Inpainting
    Wang, Ning
    Li, Jingyuan
    Zhang, Lefei
    Du, Bo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3748 - 3754
  • [28] MSRA-G: Combination of multi-scale residual attention network and generative adversarial networks for hyperspectral image classification
    Zhao, Jinling
    Hu, Lei
    Huang, Linsheng
    Wang, Chuanjian
    Liang, Dong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [29] MSE-Net: generative image inpainting with multi-scale encoder
    Yizhong Yang
    Zhihang Cheng
    Haotian Yu
    Yongqiang Zhang
    Xin Cheng
    Zhang Zhang
    Guangjun Xie
    The Visual Computer, 2022, 38 : 2647 - 2659
  • [30] MSE-Net: generative image inpainting with multi-scale encoder
    Yang, Yizhong
    Cheng, Zhihang
    Yu, Haotian
    Zhang, Yongqiang
    Cheng, Xin
    Zhang, Zhang
    Xie, Guangjun
    VISUAL COMPUTER, 2022, 38 (08): : 2647 - 2659