Pseudo-global strategy-based visual comfort assessment considering attention mechanism

被引:0
|
作者
Li, Sumei [1 ]
Zhang, Huilin [1 ]
Zhou, Mingyue [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China
关键词
Visual comfort assessment; Stereo image; Data augmentation; Attention mechanism; DISCOMFORT; DISPARITY;
D O I
10.1007/s00530-024-01570-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Assessing the comfort of stereo images contributes significantly to crafting immersive stereo scenes, thereby enriching the viewer's perceptual experience. However, deep learning-based visual comfort assessment (VCA) has encountered challenges due to data deficiency. To address this problem and maximize the potential of deep learning in the VCA task, this paper proposes a pseudo-global strategy-based convolutional neural network (CNN), considering the attention mechanism. Our data augmentation method utilizes random cropping and permutation, coupled with a pseudo-global strategy that fuses multi-region local features as pseudo-global features to substitute global features, effectively expanding databases while aligning input patches and labels during training. We also introduce attention mechanisms to focus on the different impacts of disparities in various regions on the overall comfort of a stereo image. Specifically, dilated spatial attention and channel self-attention are designed in the local and pseudo-global feature extraction stages, respectively, simulating the saliency of human perception. Experimental results show that the proposed method is superior to the state-of-the-art VCA approaches and has excellent generalization ability.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Multi-surrogates and multi-points infill strategy-based global optimization method
    Ye, Pengcheng
    Pan, Guang
    ENGINEERING WITH COMPUTERS, 2023, 39 (02) : 1617 - 1636
  • [32] An antiphishing strategy based on visual similarity assessment
    Liu, WY
    Deng, XT
    Huang, GL
    Fu, AY
    IEEE INTERNET COMPUTING, 2006, 10 (02) : 58 - 65
  • [33] Online Shopping Aging Design Strategy Based on Visual Attention
    Xue, Yanmin
    Chen, Shuting
    Liu, Yang
    HCI INTERNATIONAL 2024 POSTERS, PT II, HCII 2024, 2024, 2115 : 255 - 265
  • [34] An objective stereoscopic image visual comfort assessment metric based on visual important regions
    Jiang, Qiu-Ping
    Shao, Feng
    Jiang, Gang-Yi
    Yu, Mei
    Peng, Zong-Ju
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2014, 36 (04): : 875 - 881
  • [35] Assessment of feature fusion strategies in visual attention mechanism for saliency detection
    Jian, Muwei
    Zhou, Quan
    Cui, Chaoran
    Nie, Xiushan
    Luo, Hanjiang
    Zhao, Jianli
    Yin, Yilong
    PATTERN RECOGNITION LETTERS, 2019, 127 : 37 - 47
  • [36] Deformable Convolution Based No-Reference Stereoscopic Image Quality Assessment Considering Visual Feedback Mechanism
    Zhou, Mingyue
    Li, Sumei
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [37] Model-based shading and lighting controls considering visual comfort and energy use
    Xiong, Jie
    Tzempelikos, Athanasios
    SOLAR ENERGY, 2016, 134 : 416 - 428
  • [38] Optimal transport strategy-based meta-attention network for fault diagnosis of rotating machinery with zero sample
    Wu, Ke
    Yu, Kaiwei
    Chen, Chong
    Wu, Jun
    Liu, Yan
    APPLIED INTELLIGENCE, 2024, 54 (9-10) : 6799 - 6815
  • [39] 3-D Visual Discomfort Assessment Considering Optical and Neural Attention Models
    Yang, Jiachen
    Vanhung Nguyen
    Sim, Kyohoon
    Zhao, Yang
    Lu, Wen
    IEEE TRANSACTIONS ON BROADCASTING, 2020, 66 (02) : 279 - 291
  • [40] Subjective Visual Comfort Assessment Based on Fusion Time for Depth Information
    Yue, Guanghui
    Hou, Chunping
    Lu, Kaining
    Feng, Dandan
    Li, Yao
    2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE), 2016, : 733 - 737