Advancing Real-World Stereoscopic Image Super-Resolution via Vision-Language Model

被引:0
|
作者
Zhang, Zhe [1 ,2 ]
Lei, Jianjun [1 ]
Peng, Bo [1 ]
Zhu, Jie [1 ]
Xu, Liying [1 ]
Huang, Qingming [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ Commerce, Sch Informat Engn, Tianjin 300134, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image processing; Degradation; Superresolution; Visualization; Image reconstruction; Training; Iterative methods; Solid modeling; Computational modeling; Cognition; Super-resolution; stereoscopic image; vision-language model;
D O I
10.1109/TIP.2025.3546470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the remarkable success of the vision-language model in various computer vision tasks. However, how to exploit the semantic language knowledge of the vision-language model to advance real-world stereoscopic image super-resolution remains a challenging problem. This paper proposes a vision-language model-based stereoscopic image super-resolution (VLM-SSR) method, in which the semantic language knowledge in CLIP is exploited to facilitate stereoscopic image SR in a training-free manner. Specifically, by designing visual prompts for CLIP to infer the region similarity, a prompt-guided information aggregation mechanism is presented to capture inter-view information among relevant regions between the left and right views. Besides, driven by the prior knowledge of CLIP, a cognition prior-driven iterative enhancing mechanism is presented to optimize fuzzy regions adaptively. Experimental results on four datasets verify the effectiveness of the proposed method.
引用
收藏
页码:2187 / 2197
页数:11
相关论文
共 50 条
  • [1] Real-World Thermal Image Super-Resolution
    Allahham, Moaaz
    Aakerberg, Andreas
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    ADVANCES IN VISUAL COMPUTING (ISVC 2021), PT I, 2021, 13017 : 3 - 14
  • [2] Frequency Generation for Real-World Image Super-Resolution
    Guan, Wenxue
    Li, Haobo
    Xu, Dawei
    Liu, Jiaxin
    Gong, Shenghua
    Liu, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7029 - 7040
  • [3] Real-World Light Field Image Super-Resolution Via Degradation Modulation
    Wang, Yingqian
    Liang, Zhengyu
    Wang, Longguang
    Yang, Jungang
    An, Wei
    Guo, Yulan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [4] REAL-WORLD IMAGE SUPER-RESOLUTION VIA KERNEL AUGMENTATION AND STOCHASTIC VARIATION
    Zhang, Haiyu
    Zhu, Yu
    Sun, Jinqiu
    Zhang, Yanning
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2506 - 2510
  • [5] Leveraging vision-language prompts for real-world image restoration and enhancement
    Wei, Yanyan
    Zhang, Yilin
    Li, Kun
    Wang, Fei
    Tang, Shengeng
    Zhang, Zhao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 250
  • [6] Real-infraredSR: real-world infrared image super-resolution via thermal imager
    Zhou, Yicheng
    Liu, Yuan
    Yuan, Liyin
    Chen, Qian
    Gu, Guohua
    Sui, Xiubao
    OPTICS EXPRESS, 2023, 31 (22) : 36171 - 36187
  • [7] Dynamic degradation learning for real-world image super-resolution
    Chunxiao Fan
    Qiong Wu
    Xiang Ye
    Signal, Image and Video Processing, 2023, 17 : 315 - 322
  • [8] Dynamic degradation learning for real-world image super-resolution
    Fan, Chunxiao
    Wu, Qiong
    Ye, Xiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (02) : 315 - 322
  • [9] SwinIBSR: Towards real-world infrared image super-resolution
    Shi, Yifeng
    Chen, Nan
    Pu, Yuesheng
    Zhang, Jiqing
    Yao, Libin
    INFRARED PHYSICS & TECHNOLOGY, 2024, 139
  • [10] Exploiting Diffusion Prior for Real-World Image Super-Resolution
    Wang, Jianyi
    Yue, Zongsheng
    Zhou, Shangchen
    Chan, Kelvin C. K.
    Loy, Chen Change
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5929 - 5949