Omnidirectional Image Quality Assessment by Distortion Discrimination Assisted Multi-Stream Network

被引:40
|
作者
Zhou, Yu [1 ,2 ]
Sun, Yanjing [1 ,2 ]
Li, Leida [3 ,4 ]
Gu, Ke [5 ,6 ]
Fang, Yuming [7 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] Xuzhou Engn Res Ctr Intelligent Ind Safety & Emer, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510555, Peoples R China
[4] Pazhou Lab, Guangzhou 510330, Peoples R China
[5] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[6] Beijing Univ Technol, Engn Res Ctr Intelligent Percept & Autonomous Con, Beijing Artificial Intelligence Inst,Beijing Lab, Minist Educ,Beijing Key Lab Computat Intelligence, Beijing 100124, Peoples R China
[7] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Measurement; Distortion; Quality assessment; Sun; Image coding; Visualization; Image quality assessment; virtual reality (VR); omnidirectional image (OI); viewport generation; distortion discrimination; INDEX; DEGRADATION; STATISTICS;
D O I
10.1109/TCSVT.2021.3081162
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Omnidirectional image (OI) quality assessment is crucial to facilitate the development of virtual reality (VR) related technology. In this work, a distortion discrimination assisted multi-stream network is proposed for OI quality assessment. The multi-stream architecture is constructed by generating the viewport images received by the retina at one point to simulate the characteristics of humans perceiving VR contents. Additionally, the strategy of generating several viewport image sets from one OI is proposed for data augmentation. Furthermore, the facts that the human brain has the ability for both quality assessment and distortion type distinguishment, and the process of human brain handling two tasks exists information interaction inspire us to employ an auxiliary distortion discrimination task to facilitate the quality assessment task learning. Extensive experiments conducted on two public OI databases demonstrate the superiority of the proposed method to both traditional 2D quality metrics and existing metrics specific for OIs. Moreover, utilizing the assistant task is proven to be more effective than the single task learning for OI quality evaluation. Better generalization performance is also verified to be another valuable trait of the proposed method.
引用
收藏
页码:1767 / 1777
页数:11
相关论文
共 50 条
  • [31] Multi-stream Point-based model for Blind Geometric Point Cloud Quality Assessment
    Bourbia, Salima
    Karine, Ayoub
    Chetouani, Aladine
    El Hassouni, Mohammed
    Jridi, Maher
    20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 224 - 228
  • [32] Refined-mask guided multi-stream blending network
    Wang, Shuo
    Lv, Weijie
    Zhao, Xinyuan
    Zhang, Xinyu
    Su, Junyu
    Zeng, Long
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 56445 - 56462
  • [33] Multi-stream graph attention network for recommendation with knowledge graph
    Hu, Zhifei
    Xia, Feng
    JOURNAL OF WEB SEMANTICS, 2024, 82
  • [34] Multi-Stream Refining Network for Person Re-Identification
    Wang, Xu
    Huang, Yan
    Wang, Qicong
    Chen, Yan
    Shen, Yehu
    IEEE ACCESS, 2021, 9 : 6596 - 6607
  • [35] Virtual Path Implementation of Multi-stream Routing in Network on Chip
    Chojnacki, Bartosz
    Maka, Tomasz
    Dziurzanski, Piotr
    PARALLEL COMPUTING TECHNOLOGIES, 2011, 6873 : 431 - 436
  • [36] Multi-stream Network for Human-object Interaction Detection
    Wang, Chang
    Sun, Jinyu
    Ma, Shiwei
    Lu, Yuqiu
    Liu, Wang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (08)
  • [37] MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction
    Ham, Je-Seok
    Bae, Kangmin
    Moon, Jinyoung
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 13801 LNCS : 663 - 679
  • [38] Stochastic Fusion for Multi-stream Neural Network in Video Classification
    Huang, Yu-Min
    Tseng, Huan-Hsin
    Chien, Jen-Tzung
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 69 - 74
  • [39] Multi-stream Network With Temporal Attention For Environmental Sound Classification
    Li, Xinyu
    Chebiyyam, Venkata
    Kirchhoff, Katrin
    INTERSPEECH 2019, 2019, : 3604 - 3608
  • [40] Multi-Angle Projection Based Blind Omnidirectional Image Quality Assessment
    Jiang, Hao
    Jiang, Gangyi
    Yu, Mei
    Luo, Ting
    Xu, Haiyong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4211 - 4223