Omnidirectional Image Quality Assessment by Distortion Discrimination Assisted Multi-Stream Network

被引:40
|
作者
Zhou, Yu [1 ,2 ]
Sun, Yanjing [1 ,2 ]
Li, Leida [3 ,4 ]
Gu, Ke [5 ,6 ]
Fang, Yuming [7 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] Xuzhou Engn Res Ctr Intelligent Ind Safety & Emer, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510555, Peoples R China
[4] Pazhou Lab, Guangzhou 510330, Peoples R China
[5] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[6] Beijing Univ Technol, Engn Res Ctr Intelligent Percept & Autonomous Con, Beijing Artificial Intelligence Inst,Beijing Lab, Minist Educ,Beijing Key Lab Computat Intelligence, Beijing 100124, Peoples R China
[7] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Measurement; Distortion; Quality assessment; Sun; Image coding; Visualization; Image quality assessment; virtual reality (VR); omnidirectional image (OI); viewport generation; distortion discrimination; INDEX; DEGRADATION; STATISTICS;
D O I
10.1109/TCSVT.2021.3081162
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Omnidirectional image (OI) quality assessment is crucial to facilitate the development of virtual reality (VR) related technology. In this work, a distortion discrimination assisted multi-stream network is proposed for OI quality assessment. The multi-stream architecture is constructed by generating the viewport images received by the retina at one point to simulate the characteristics of humans perceiving VR contents. Additionally, the strategy of generating several viewport image sets from one OI is proposed for data augmentation. Furthermore, the facts that the human brain has the ability for both quality assessment and distortion type distinguishment, and the process of human brain handling two tasks exists information interaction inspire us to employ an auxiliary distortion discrimination task to facilitate the quality assessment task learning. Extensive experiments conducted on two public OI databases demonstrate the superiority of the proposed method to both traditional 2D quality metrics and existing metrics specific for OIs. Moreover, utilizing the assistant task is proven to be more effective than the single task learning for OI quality evaluation. Better generalization performance is also verified to be another valuable trait of the proposed method.
引用
收藏
页码:1767 / 1777
页数:11
相关论文
共 50 条
  • [1] Multi-Stream Fusion Network for Multi-Distortion Image Super-Resolution
    Wen, Yang
    Xu, Yupeng
    Sheng, Bin
    Li, Ping
    Bi, Lei
    Kim, Jinman
    He, Xiangui
    Xu, Xun
    ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 242 - 251
  • [2] A Multi-Stream Fusion Network for Image Splicing Localization
    Siopi, Maria
    Kordopatis-Zilos, Giorgos
    Charitidis, Polychronis
    Kompatsiaris, Ioannis
    Papadopoulos, Symeon
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 611 - 622
  • [3] A Novel Multi-Stream Fusion Network for Underwater Image Enhancement
    Tang, Guijin
    Duan, Lian
    Zhao, Haitao
    Liu, Feng
    CHINA COMMUNICATIONS, 2024, 21 (02) : 166 - 182
  • [4] A Novel Multi-Stream Fusion Network for Underwater Image Enhancement
    Guijin Tang
    Lian Duan
    Haitao Zhao
    Feng Liu
    China Communications, 2024, 21 (02) : 166 - 182
  • [5] QualityNet: A multi-stream fusion framework with spatial and channel attention for blind image quality assessment
    Aslam, Muhammad Azeem
    Wei, Xu
    Khalid, Hassan
    Ahmed, Nisar
    Zhu, Shuangtong
    Liu, Xin
    Xu, Yimei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [6] A multi-stream network for retrosynthesis prediction
    Qiang Zhang
    Juan Liu
    Wen Zhang
    Feng Yang
    Zhihui Yang
    Xiaolei Zhang
    Frontiers of Computer Science, 2024, 18
  • [7] A multi-stream network for retrosynthesis prediction
    Zhang, Qiang
    Liu, Juan
    Zhang, Wen
    Yang, Feng
    Yang, Zhihui
    Zhang, Xiaolei
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (02)
  • [8] Gaze-Assisted Multi-Stream Deep Neural Network for Action Recognition
    Liu, Yinan
    Wu, Qingbo
    Tang, Liangzhi
    Shi, Hengcan
    IEEE ACCESS, 2017, 5 : 19432 - 19441
  • [9] Multi-Stream Dense View Reconstruction Network for Light Field Image Compression
    Liu, Deyang
    Huang, Yan
    Fang, Yuming
    Zuo, Yifan
    An, Ping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4400 - 4414
  • [10] Image Aesthetics Assessment Based on Multi-stream CNN Architecture and Saliency Features
    Takimoto, Hironori
    Omori, Fumiya
    Kanagawa, Akihiro
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (01) : 25 - 40