Advancing Real-World Stereoscopic Image Super-Resolution via Vision-Language Model

被引:0
|
作者
Zhang, Zhe [1 ,2 ]
Lei, Jianjun [1 ]
Peng, Bo [1 ]
Zhu, Jie [1 ]
Xu, Liying [1 ]
Huang, Qingming [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ Commerce, Sch Informat Engn, Tianjin 300134, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image processing; Degradation; Superresolution; Visualization; Image reconstruction; Training; Iterative methods; Solid modeling; Computational modeling; Cognition; Super-resolution; stereoscopic image; vision-language model;
D O I
10.1109/TIP.2025.3546470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the remarkable success of the vision-language model in various computer vision tasks. However, how to exploit the semantic language knowledge of the vision-language model to advance real-world stereoscopic image super-resolution remains a challenging problem. This paper proposes a vision-language model-based stereoscopic image super-resolution (VLM-SSR) method, in which the semantic language knowledge in CLIP is exploited to facilitate stereoscopic image SR in a training-free manner. Specifically, by designing visual prompts for CLIP to infer the region similarity, a prompt-guided information aggregation mechanism is presented to capture inter-view information among relevant regions between the left and right views. Besides, driven by the prior knowledge of CLIP, a cognition prior-driven iterative enhancing mechanism is presented to optimize fuzzy regions adaptively. Experimental results on four datasets verify the effectiveness of the proposed method.
引用
收藏
页码:2187 / 2197
页数:11
相关论文
共 50 条
  • [21] Real-World Image Super-Resolution as Multi-Task Learning
    Zhang, Wenlong
    Li, Xiaohui
    Shi, Guangyuan
    Chen, Xiangyu
    Zhang, Xiaoyun
    Qiao, Yu
    Wu, Xiao-Ming
    Dong, Chao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] Unsupervised Learning for Real-World Super-Resolution
    Lugmayr, Andreas
    Danelljan, Martin
    Timofte, Radu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3408 - 3416
  • [23] Frequency Separation for Real-World Super-Resolution
    Fritsche, Manuel
    Gu, Shuhang
    Timofte, Radu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3599 - 3608
  • [24] Real-World Super-Resolution with Residual Consistency
    Saritas, Erdi
    Ekenel, Hazim Kemal
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [25] Real-World Super-Resolution via Kernel Estimation and Noise Injection
    Ji, Xiaozhong
    Cao, Yun
    Tai, Ying
    Wang, Chengjie
    Li, Jilin
    Huang, Feiyue
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1914 - 1923
  • [26] Toward Real-World Super-Resolution via Adaptive Downsampling Models
    Son, Sanghyun
    Kim, Jaeha
    Lai, Wei-Sheng
    Yang, Ming-Hsuan
    Lee, Kyoung Mu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8657 - 8670
  • [27] Deep Stereoscopic Image Super-Resolution via Interaction Module
    Lei, Jianjun
    Zhang, Zhe
    Fan, Xiaoting
    Yang, Bolan
    Li, Xinxin
    Chen, Ying
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3051 - 3061
  • [28] GDSSR: Toward Real-World Ultra-High-Resolution Image Super-Resolution
    Chi, Yichen
    Yang, Wenming
    Tian, Yapeng
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 95 - 99
  • [29] Toward Real-World Remote Sensing Image Super-Resolution: A New Benchmark and an Efficient Model
    Wang, Jia
    Xiang, Liuyu
    Liu, Lei
    Xu, Jiaochong
    Li, Peipei
    Xu, Qizhi
    He, Zhaofeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [30] Real-world remote sensing image super-resolution via a practical degradation model and a kernel-aware network
    Dong, Runmin
    Mou, Lichao
    Zhang, Lixian
    Fu, Haohuan
    Zhu, Xiao Xiang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 191 : 155 - 170