Dissecting the effectiveness of deep features as metric of perceptual image quality

被引:0
|
作者
Hernandez-Camara, Pablo [1 ]
Vila-Tomas, Jorge [1 ]
Laparra, Valero [1 ]
Malo, Jesus [1 ]
机构
[1] Univ Valencia, Image Proc Lab, Paterna 46980, Spain
关键词
Image quality; Neural networks; Visual neuroscience; Functional principle; Learning environment; Architecture; MODELS; INFORMATION; VISION;
D O I
10.1016/j.neunet.2025.107189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is an open debate on the role of artificial networks to understand the visual brain. Internal representations of images in artificial networks develop human-like properties. In particular, evaluating distortions using differences between internal features is correlated to human perception of distortion. However, the origins of this correlation are not well understood. Here, we dissect the different factors involved in the emergence of human-like behavior: function, architecture, and environment. To do so, we evaluate the aforementioned human-network correlation at different depths of 46 pre-trained model configurations that include no psycho-visual information. The results show that most of the models correlate better with human opinion than SSIM (a de-facto standard in subjective image quality). Moreover, some models are better than state-of-the-art networks specifically tuned for the application (LPIPS, DISTS). Regarding the function, supervised classification leads to nets that correlate better with humans than the explored models for self- and non-supervised tasks. However, we found that better performance in the task does not imply more human behavior. Regarding the architecture, simpler models correlate better with humans than very deep nets and generally, the highest correlation is not achieved in the last layer. Finally, regarding the environment, training with large natural datasets leads to bigger correlations than training in smaller databases with restricted content, as expected. We also found that the best classification models are not the best for predicting human distances. In the general debate about understanding human vision, our empirical findings imply that explanations have not to be focused on a single abstraction level, but all function, architecture, and environment are relevant.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
    Zhang, Richard
    Isola, Phillip
    Efros, Alexei A.
    Shechtman, Eli
    Wang, Oliver
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 586 - 595
  • [2] Perceptual metric for image quality
    Liu, Y
    Zhou, H
    IEEE-EMBS ASIA PACIFIC CONFERENCE ON BIOMEDICAL ENGINEERING - PROCEEDINGS, PTS 1 & 2, 2000, : 266 - 267
  • [3] Perceptual image quality assessment metric using mutual information of Gabor features
    DING Yong
    ZHANG Yuan
    WANG Xiang
    YAN XiaoLang
    KRYLOV Andrey S.
    ScienceChina(InformationSciences), 2014, 57 (03) : 130 - 138
  • [4] Perceptual image quality assessment metric using mutual information of Gabor features
    Yong Ding
    Yuan Zhang
    Xiang Wang
    XiaoLang Yan
    Andrey S. Krylov
    Science China Information Sciences, 2014, 57 : 1 - 9
  • [5] Perceptual image quality assessment metric using mutual information of Gabor features
    Ding Yong
    Zhang Yuan
    Wang Xiang
    Yan XiaoLang
    Krylov, Andrey S.
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (03) : 1 - 9
  • [6] A Perceptual Blind Blur Image Quality Metric
    Kerouh, Fatma
    Serir, Amina
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] A Perceptual Quality Metric for Performance Evaluation of Image Fusion
    Jian, Muwei
    Ma, Ping
    Jia, Jianfeng
    IITSI 2009: SECOND INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS, 2009, : 148 - +
  • [8] Perceptual Dissimilarity Metric: A Full Reference Objective Image Quality Measure to Quantify the Degradation of Perceptual Image Quality
    Saha, Sajib
    Tahtali, Murat
    Lambert, Andrew
    Pickering, Mark
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 327 - 332
  • [9] Dissecting Deep Metric Learning Losses for Image-Text Retrieval
    Xuan, Hong
    Chen, Xi
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2163 - 2172
  • [10] Deep ensembling for perceptual image quality assessment
    Ahmed, Nisar
    Asif, H. M. Shahzad
    Bhatti, Abdul Rauf
    Khan, Atif
    SOFT COMPUTING, 2022, 26 (16) : 7601 - 7622