STRAT: Image style transfer with region-aware transformer

被引:0
|
作者
Qi, Na [1 ]
Li, Yezi [1 ]
Fu, Rao [1 ]
Zhu, Qing [1 ]
机构
[1] Beijing Univ Technol, Beijing 100124, Peoples R China
关键词
Style transfer; Semantic-aware transformer; Region awareness; Statistic-based; Mask-guided cross attention;
D O I
10.1016/j.neucom.2024.129039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Style transfer methods render various artistic styles to a natural image through the extraction and transfer of textural features. Existing neural style transfer methods often rely on CNNs to extract image features and tend to suffer from feature leakage and content distortion due to limited receptive fields. Transformer-based style transfer methods outperform CNN-based methods by learning the global information of image through self- attention mechanism. However, local features are ignored and details are lost since the semantic information of images is not taken into account. To address this critical issue, this paper proposes a novel style transfer framework based on region-aware transformer (STRAT). We integrate the CNN based short-range branch with the transformer-based long-range branch to extract both local and non-local features to achieve region-adaptive texture transfer with two region-aware attention modules, respectively. Specifically, we utilize the SNR metric and masks as guide to propose the SNR-guided attention module and mask-guided cross attention module to enable region-varying feature extraction and adaptive texture transfer, respectively. Extensive experimental results demonstrate that our proposed method outperforms the state-of-the-arts methods in terms of subjective and objective results.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Hairstyle-and-identity-aware facial image style transfer with region-guiding masks
    Wang, Hsin-Ying
    Chien, Chiu-Wei
    Tsai, Ming-Han
    Lin, I-Chen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 51867 - 51897
  • [32] Hairstyle-and-identity-aware facial image style transfer with region-guiding masks
    Hsin-Ying Wang
    Chiu-Wei Chien
    Ming-Han Tsai
    I-Chen Lin
    Multimedia Tools and Applications, 2024, 83 : 51867 - 51897
  • [33] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
    Chen, Zhennan
    Li, Yajie
    Wang, Haofan
    Chen, Zhibo
    Jiang, Zhengkai
    Li, Jun
    Wang, Qian
    Yang, Jian
    Tai, Ying
    arXiv,
  • [34] Semantic Context-Aware Image Style Transfer
    Liao, Yi-Sheng
    Huang, Chun-Rong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1911 - 1923
  • [35] ILLUMINATION-AWARE STYLE TRANSFER FOR IMAGE HARMONIZATION
    Ren, Teng
    Zhang, Haitao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2856 - 2860
  • [36] Continual learning via region-aware memory
    Zhao, Kai
    Fu, Zhenyong
    Yang, Jian
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8389 - 8401
  • [37] Toward Generalized and Realistic Unpaired Image Dehazing via Region-Aware Physical Constraints
    Lin, Kaihao
    Wang, Guoqing
    Li, Tianyu
    Wu, Yuhui
    Li, Chongyi
    Yang, Yang
    Tao Shen, Heng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2753 - 2767
  • [38] Region-aware POI Recommendation with Semantic Spatial Graph
    Tang, Jiakai
    Jin, Jiahui
    Miao, Zijia
    Zhang, Binjie
    An, Qi
    Zhang, Jinghui
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 214 - 219
  • [39] Region-Aware Portrait Retouching With Sparse Interactive Guidance
    Zeng, Huimin
    Huang, Jie
    Li, Jiacheng
    Xiong, Zhiwei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 127 - 140
  • [40] HRANet: Hierarchical region-aware network for crowd counting
    Jinyang Xie
    Lingyu Gu
    Zhonghui Li
    Lei Lyu
    Applied Intelligence, 2022, 52 : 12191 - 12205