Pseudo-global strategy-based visual comfort assessment considering attention mechanism

被引:0
|
作者
Li, Sumei [1 ]
Zhang, Huilin [1 ]
Zhou, Mingyue [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China
关键词
Visual comfort assessment; Stereo image; Data augmentation; Attention mechanism; DISCOMFORT; DISPARITY;
D O I
10.1007/s00530-024-01570-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Assessing the comfort of stereo images contributes significantly to crafting immersive stereo scenes, thereby enriching the viewer's perceptual experience. However, deep learning-based visual comfort assessment (VCA) has encountered challenges due to data deficiency. To address this problem and maximize the potential of deep learning in the VCA task, this paper proposes a pseudo-global strategy-based convolutional neural network (CNN), considering the attention mechanism. Our data augmentation method utilizes random cropping and permutation, coupled with a pseudo-global strategy that fuses multi-region local features as pseudo-global features to substitute global features, effectively expanding databases while aligning input patches and labels during training. We also introduce attention mechanisms to focus on the different impacts of disparities in various regions on the overall comfort of a stereo image. Specifically, dilated spatial attention and channel self-attention are designed in the local and pseudo-global feature extraction stages, respectively, simulating the saliency of human perception. Experimental results show that the proposed method is superior to the state-of-the-art VCA approaches and has excellent generalization ability.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Strategy-based Fault Handling Mechanism for Composite Service
    Jiang, Weihao
    Ma, Dianfu
    Zhao, Yongwang
    2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 1297 - 1301
  • [2] VISUAL COMFORT ASSESSMENT OF STEREOSCOPIC IMAGES USING DEEP VISUAL AND DISPARITY FEATURES BASED ON HUMAN ATTENTION
    Jeong, Hyunwook
    Kim, Hak Gu
    Ro, Yong Man
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 715 - 719
  • [3] Visual comfort enhancement study based on visual attention detection for stereoscopic displays
    Xia, Zhenping
    Cheng, Cheng
    Li, Xiaohua
    JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2016, 24 (10) : 633 - 640
  • [4] No-Reference Stereoscopic Image Quality Assessment Based On Visual Attention Mechanism
    Li, Sumei
    Zhao, Ping
    Chang, Yongli
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 326 - 329
  • [5] Leveraging visual attention and neural activity for stereoscopic 3D visual comfort assessment
    Jiang, Qiuping
    Shao, Feng
    Jiang, Gangyi
    Yu, Mei
    Peng, Zongju
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (07) : 9405 - 9425
  • [6] Leveraging visual attention and neural activity for stereoscopic 3D visual comfort assessment
    Qiuping Jiang
    Feng Shao
    Gangyi Jiang
    Mei Yu
    Zongju Peng
    Multimedia Tools and Applications, 2017, 76 : 9405 - 9425
  • [7] Active learning strategy-based reliability assessment on the wear of spur gears
    Qian, Hua-Ming
    Huang, Tudi
    Wei, Jing
    Huang, Hong-Zhong
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2023, 37 (12) : 6467 - 6476
  • [8] Active learning strategy-based reliability assessment on the wear of spur gears
    Hua-Ming Qian
    Tudi Huang
    Jing Wei
    Hong-Zhong Huang
    Journal of Mechanical Science and Technology, 2023, 37 : 6467 - 6476
  • [9] MTPA With Pseudo-FOC Strategy-Based BLDC for Minimization of Copper Loss and Torque Ripple
    Singh, R. Raja
    Kalel, Dattatraya Dilip
    Stonier, Albert Alexander
    Peter, Geno
    Stephen, Valantina
    Arun, Vijayakumar
    Ganji, Vivekananda
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2025, 2025 (01)
  • [10] Image segmentation based on visual attention mechanism
    Zhang, Qiaorong
    Gu, Guochang
    Xiao, Huimin
    Journal of Multimedia, 2009, 4 (06): : 363 - 370