Exploring the Temporal Consistency of Arbitrary Style Transfer: A Channelwise Perspective

被引:17
|
作者
Kong, Xiaoyu [1 ,2 ]
Deng, Yingying [3 ]
Tang, Fan [4 ]
Dong, Weiming [3 ]
Ma, Chongyang [5 ]
Chen, Yongyong
He, Zhenyu [6 ,7 ]
Xu, Changsheng [3 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun 130012, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518073, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[5] Kuaishou Technol, Beijing 100085, Peoples R China
[6] Harbin Inst Technol, Dept Comp Sci, Shenzhen 518073, Peoples R China
[7] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
Correlation; Task analysis; Optical imaging; Integrated optics; Lighting; Optical fiber networks; Image reconstruction; Arbitrary stylization; channel correlation; cross-domain; feature migration;
D O I
10.1109/TNNLS.2022.3230084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Arbitrary image stylization by neural networks has become a popular topic, and video stylization is attracting more attention as an extension of image stylization. However, when image stylization methods are applied to videos, unsatisfactory results that suffer from severe flickering effects appear. In this article, we conducted a detailed and comprehensive analysis of the cause of such flickering effects. Systematic comparisons among typical neural style transfer approaches show that the feature migration modules for state-of-the-art (SOTA) learning systems are ill-conditioned and could lead to a channelwise misalignment between the input content representations and the generated frames. Unlike traditional methods that relieve the misalignment via additional optical flow constraints or regularization modules, we focus on keeping the temporal consistency by aligning each output frame with the input frame. To this end, we propose a simple yet efficient multichannel correlation network (MCCNet), to ensure that output frames are directly aligned with inputs in the hidden feature space while maintaining the desired style patterns. An inner channel similarity loss is adopted to eliminate side effects caused by the absence of nonlinear operations such as softmax for strict alignment. Furthermore, to improve the performance of MCCNet under complex light conditions, we introduce an illumination loss during training. Qualitative and quantitative evaluations demonstrate that MCCNet performs well in arbitrary video and image style transfer tasks.
引用
收藏
页码:8482 / 8496
页数:15
相关论文
共 50 条
  • [21] DETAIL-PRESERVING ARBITRARY STYLE TRANSFER
    Zhu, Ling
    Liu, Shiguang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [22] A SIMPLE WAY OF MULTIMODAL AND ARBITRARY STYLE TRANSFER
    Anh-Duc Nguyen
    Choi, Seonghwa
    Kim, Woojae
    Lee, Sanghoon
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1752 - 1756
  • [23] Arbitrary Style Transfer with Adaptive Channel Network
    Wang, Yuzhuo
    Geng, Yanlin
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 481 - 492
  • [24] CLAST: Contrastive Learning for Arbitrary Style Transfer
    Wang, Xinhao
    Wang, Wenjing
    Yang, Shuai
    Liu, Jiaying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6761 - 6772
  • [25] Assessing arbitrary style transfer like an artist
    Chen, Hangwei
    Shao, Feng
    Mu, Baoyang
    Jiang, Qiuping
    DISPLAYS, 2024, 85
  • [26] Arbitrary Style Transfer with Deep Feature Reshuffle
    Gu, Shuyang
    Chen, Congliang
    Liao, Jing
    Yuan, Lu
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8222 - 8231
  • [27] Style-Aware Normalized Loss for Improving Arbitrary Style Transfer
    Cheng, Jiaxin
    Jaiswal, Ayush
    Wu, Yue
    Natarajan, Pradeep
    Natarajan, Prem
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 134 - 143
  • [28] PARAMETER-FREE STYLE PROJECTION FOR ARBITRARY IMAGE STYLE TRANSFER
    Huang, Siyu
    Xiong, Haoyi
    Wang, Tianyang
    Wen, Bihan
    Wang, Qingzhong
    Chen, Zeyu
    Huan, Jun
    Dou, Dejing
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2070 - 2074
  • [29] LIGHT FIELD STYLE TRANSFER WITH LOCAL ANGULAR CONSISTENCY
    Egan, Donal
    Alain, Martin
    Smolic, Aljosa
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2300 - 2304
  • [30] Arbitrary Style Transfer with Parallel Self-Attention
    Zhang, Tiange
    Gao, Ying
    Gao, Feng
    Qi, Lin
    Dong, Junyu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1406 - 1413