Perception-Oriented U-Shaped Transformer Network for 360-Degree No-Reference Image Quality Assessment

被引:40
|
作者
Zhou, Mingliang [1 ]
Chen, Lei [1 ]
Wei, Xuekai [1 ]
Liao, Xingran [2 ]
Mao, Qin [3 ,4 ]
Wang, Heqiang [1 ]
Pu, Huayan [5 ]
Luo, Jun [5 ]
Xiang, Tao [1 ]
Fang, Bin [1 ]
机构
[1] Chongqing Univ, Sch Comp Sci, Chongqing 400044, Peoples R China
[2] City Univ Hong Kong, Comp Sci Dept, Hong Kong, Peoples R China
[3] Qiannan Normal Coll Nationalities, Coll Comp & Informat, Duyun 558000, Peoples R China
[4] Qiannan Normal Univ Nationalities, Sch Comp & Informat, Key Lab Complex Syst & Intelligent Optimizat Guizh, Duyun 558000, Peoples R China
[5] Chongqing Univ, State Key Lab Mech Transmiss, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Image quality assessment; no-reference image quality assessment; 360-degree image; U-shaped transformer; OMNIDIRECTIONAL IMAGE; SALIENCY; CNN;
D O I
10.1109/TBC.2022.3231101
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Generally, 360-degree images have absolute senses of reality and three-dimensionality, providing a wide range of immersive interactions. Due to the novel rendering and display technology of 360-degree images, they have more complex perceptual characteristics than other images. It is challenging to perform comprehensive image quality assessment (IQA) learning by simply stacking multichannel neural network architectures for pre/postprocessing, compression, and rendering tasks. To thoroughly learn the global and local features in 360-degree images, reduce the complexity of multichannel neural network models and simplify the training process, this paper proposes a joint architecture with user perception and an efficient transformer dedicated to 360-degree no-reference (NR) IQA. The input of the proposed method is a 360-degree cube map projection (CMP) image. Furthermore, the proposed 360-degree NRIQA method includes a saliency map-based non-overlapping self-attention selection module and a U-shaped transformer (U-former)-based feature extraction module to account for perceptual region importance and projection distortion. The transformer-based architecture and the weighted average technique are jointly utilized for predicting local perceptual quality. Experimental results obtained on widely used databases show that the proposed model outperforms other state-of-the-art methods in NR 360-degree image quality evaluation cases. Furthermore, a cross-database evaluation and an ablation study also demonstrate the inherent robustness and generalization ability of the proposed model.
引用
收藏
页码:396 / 405
页数:10
相关论文
共 50 条
  • [1] Adaptive Hypergraph Convolutional Network for No-Reference 360-degree Image Quality Assessment
    Fu, Jun
    Hou, Chen
    Zhou, Wei
    Xu, Jiahua
    Chen, Zhibo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 961 - 969
  • [2] StyleAM: Perception-Oriented Unsupervised Domain Adaption for No-Reference Image Quality Assessment
    Lu, Yiting
    Li, Xin
    Liu, Jianzhao
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2043 - 2058
  • [3] Saliency and Depth-Aware Full Reference 360-Degree Image Quality Assessment
    Wei, Xuekai
    Huang, Qunyue
    Fang, Bin
    Ouyang, Lei
    Xian, Weizhi
    Luo, Jun
    Pu, Huayan
    Xu, Xueyong
    Lu, Chang
    Nan, Hao
    Liu, Xu
    Li, Yachao
    Zhou, Mingliang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [4] No-Reference Quality Assessment for 360-Degree Images by Analysis of Multifrequency Information and Local-Global Naturalness
    Zhou, Wei
    Xu, Jiahua
    Jiang, Qiuping
    Chen, Zhibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1778 - 1791
  • [5] PERCEPTION-ORIENTED OMNIDIRECTIONAL IMAGE SUPER-RESOLUTION BASED ON TRANSFORMER NETWORK
    An, Hongyu
    Zhang, Xinfeng
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3583 - 3587
  • [6] Dual-attention pyramid transformer network for No-Reference Image Quality Assessment
    Ma, Jiliang
    Chen, Yihua
    Chen, Lv
    Tang, Zhenjun
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257
  • [7] Collaborative transformer U-shaped network for medical image segmentation
    Gao, Yufei
    Zhang, Shichao
    Shi, Lei
    Zhao, Guohua
    Shi, Yucheng
    APPLIED SOFT COMPUTING, 2025, 173
  • [8] Lightweight transformer and multi-head prediction network for no-reference image quality assessment
    Tang, Zhenjun
    Chen, Yihua
    Chen, Zhiyuan
    Liang, Xiaoping
    Zhang, Xianquan
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04): : 1947 - 1957
  • [9] Lightweight transformer and multi-head prediction network for no-reference image quality assessment
    Zhenjun Tang
    Yihua Chen
    Zhiyuan Chen
    Xiaoping Liang
    Xianquan Zhang
    Neural Computing and Applications, 2024, 36 : 1931 - 1946
  • [10] HIERARCHICAL FEATURE FUSION TRANSFORMER FOR NO-REFERENCE IMAGE QUALITY ASSESSMENT
    Wang, Zesheng
    Wu, Wei
    Yuan, Liang
    Sun, Wei
    Chen, Ying
    Li, Kai
    Zhai, Guangtao
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2205 - 2209