Transformer-Based Light Field Geometry Learning for No-Reference Light Field Image Quality Assessment

被引:3
|
作者
Lin, Lili [1 ]
Bai, Siyu [1 ]
Qu, Mengjia [1 ]
Wei, Xuehui [2 ]
Wang, Luyao [2 ]
Wu, Feifan [1 ]
Liu, Biao [1 ]
Zhou, Wenhui [2 ]
Kuruoglu, Ercan Engin [3 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[3] Tsinghua Univ, Data Sci & Informat Technol Ctr, Tsinghua Berkeley Shenzhen Inst, Shenzhen 518071, Peoples R China
关键词
Light field; no-reference quality assessment; light field geometry; geometric distortion learning; spatial-shift tokenization; STRUCTURAL SIMILARITY;
D O I
10.1109/TBC.2024.3353579
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Elevating traditional 2-dimensional (2D) plane display to 4-dimensional (4D) light field display can significantly enhance users' immersion and realism, because light field image (LFI) provides various visual cues in terms of multi-view disparity, motion disparity, and selective focus. Therefore, it is crucial to establish a light field image quality assessment (LF-IQA) model that aligns with human visual perception characteristics. However, it has always been a challenge to evaluate the perceptual quality of multiple light field visual cues simultaneously and consistently. To this end, this paper proposes a Transformer-based explicit learning of light field geometry for the no-reference light field image quality assessment. Specifically, to explicitly learn the light field epipolar geometry, we stack up light field sub-aperture images (SAIs) to form four SAI stacks according to four specific light field angular directions, and use a sub-grouping strategy to hierarchically learn the local and global light field geometric features. Then, a Transformer encoder with a spatial-shift tokenization strategy is applied to learn structure-aware light field geometric distortion representation, which is used to regress the final quality score. Evaluation experiments are carried out on three commonly used light field image quality assessment datasets: Win5-LID, NBU-LF1.0, and MPI-LFA. Experimental results demonstrate that our model outperforms state-of-the-art methods and exhibits a high correlation with human perception. The source code is publicly available at https://github.com/windyz77/GeoNRLFIQA.
引用
收藏
页码:597 / 606
页数:10
相关论文
共 50 条
  • [1] No-Reference Light Field Image Quality Assessment Exploiting Saliency
    Lamichhane, Kamal
    Neri, Michael
    Battisti, Federica
    Paudyal, Pradip
    Carli, Marco
    IEEE TRANSACTIONS ON BROADCASTING, 2023, 69 (03) : 790 - 800
  • [2] Tensor Oriented No-Reference Light Field Image Quality Assessment
    Zhou, Wei
    Shi, Likun
    Chen, Zhibo
    Zhang, Jinglin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4070 - 4084
  • [3] No-Reference Light Field Image Quality Assessment Based on Micro-Lens Image
    Luo, Ziyuan
    Zhou, Wei
    Shi, Likun
    Chen, Zhibo
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [4] Transformer-Based No-Reference Image Quality Assessment via Supervised Contrastive Learning
    Shi, Jinsong
    Gao, Pan
    Qin, Jie
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4829 - 4837
  • [5] No-reference light field image quality assessment based on depth, structural and angular information
    Xiang, Jianjun
    Jiang, Gangyi
    Yu, Mei
    Bai, Yongqiang
    Zhu, Zhongjie
    SIGNAL PROCESSING, 2021, 184
  • [6] No-Reference Light Field Image Quality Assessment Based on Spatial-Angular Measurement
    Shi, Likun
    Zhou, Wei
    Chen, Zhibo
    Zhang, Jinglin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4114 - 4128
  • [7] Spatial-angular features based no-reference light field quality assessment
    Yu, Zerui
    Li, Fei
    Zhou, Zhiheng
    Tao, Xiyuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265
  • [8] No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform
    Xiang, Jianjun
    Jiang, Gangyi
    Yu, Mei
    Jiang, Zhidi
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 457 - 472
  • [9] No-reference image quality assessment based on improved vision transformer and transfer learning
    Zhang, Bo
    Wang, Luoxi
    Zhang, Cheng
    Zhao, Ran
    Sun, Jinlu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
  • [10] NO-REFERENCE QUALITY EVALUATION OF LIGHT FIELD CONTENT BASED ON STRUCTURAL REPRESENTATION OF THE EPIPOLAR PLANE IMAGE
    Ak, Ali
    Ling, Suiyi
    Le Callet, Patrick
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,