Advancing Real-World Stereoscopic Image Super-Resolution via Vision-Language Model

被引:0
|
作者
Zhang, Zhe [1 ,2 ]
Lei, Jianjun [1 ]
Peng, Bo [1 ]
Zhu, Jie [1 ]
Xu, Liying [1 ]
Huang, Qingming [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ Commerce, Sch Informat Engn, Tianjin 300134, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image processing; Degradation; Superresolution; Visualization; Image reconstruction; Training; Iterative methods; Solid modeling; Computational modeling; Cognition; Super-resolution; stereoscopic image; vision-language model;
D O I
10.1109/TIP.2025.3546470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the remarkable success of the vision-language model in various computer vision tasks. However, how to exploit the semantic language knowledge of the vision-language model to advance real-world stereoscopic image super-resolution remains a challenging problem. This paper proposes a vision-language model-based stereoscopic image super-resolution (VLM-SSR) method, in which the semantic language knowledge in CLIP is exploited to facilitate stereoscopic image SR in a training-free manner. Specifically, by designing visual prompts for CLIP to infer the region similarity, a prompt-guided information aggregation mechanism is presented to capture inter-view information among relevant regions between the left and right views. Besides, driven by the prior knowledge of CLIP, a cognition prior-driven iterative enhancing mechanism is presented to optimize fuzzy regions adaptively. Experimental results on four datasets verify the effectiveness of the proposed method.
引用
收藏
页码:2187 / 2197
页数:11
相关论文
共 50 条
  • [31] Real-World Video Super-Resolution with a Degradation-Adaptive Model
    Lu, Mingxuan
    Zhang, Peng
    SENSORS, 2024, 24 (07)
  • [32] Real-World License Plate Image Super-Resolution via Domain-specific Degradation Modeling
    Luo, Xin
    Huang, Yihao
    Miao, Weikai
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1175 - 1180
  • [33] Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution
    Liang, Jie
    Zeng, Hui
    Zhang, Lei
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 574 - 591
  • [34] A Data Enhancement Method toward Real-World Single Image Super-Resolution
    Yang, Jiaqi
    Li, Qi
    Chen, Yueting
    Xu, Zhihai
    Feng, Huajin
    Wang, Jing
    INTERNATIONAL CONFERENCE ON PHYSICS, PHOTONICS AND OPTICAL ENGINEERING, ICPPOE 2022, 2023, 2440
  • [35] Simple and Efficient Unpaired Real-world Super-Resolution using Image Statistics
    Yoon, Kwangjin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1983 - 1990
  • [36] AIM 2019 Challenge on Real-World Image Super-Resolution: Methods and Results
    Lugmayr, Andreas
    Danelljan, Martin
    Timofte, Radu
    Fritsche, Manuel
    Gu, Shuhang
    Purohit, Kuldeep
    Kandula, Praveen
    Suin, Maitreya
    Rajagopalan, A. N.
    Joon, Nam Hyung
    Won, Yu Seung
    Kim, Guisik
    Kwon, Dokyeong
    Hsu, Chih-Chung
    Lin, Chia-Hsiang
    Huang, Yuanfei
    Sun, Xiaopeng
    Lu, Wen
    Li, Jie
    Gao, Xinbo
    Bell-Kligler, Sefi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3575 - 3583
  • [37] Confidence-Based Iterative Generation for Real-World Image Super-Resolution
    Peng, Jialun
    Luo, Xin
    Fu, Jingjing
    Liu, Dong
    COMPUTER VISION - ECCV 2024, PT LXV, 2025, 15123 : 323 - 341
  • [38] NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results
    Lugmayr, Andreas
    Danelljan, Martin
    Timofte, Radu
    Ahn, Namhyuk
    Bai, Dongwoon
    Cai, Jie
    Cao, Yun
    Chen, Junyang
    Cheng, Kaihua
    Chun, SeYoung
    Deng, Wei
    El-Khamy, Mostafa
    Ho, Chiu Man
    Ji, Xiaozhong
    Kheradmand, Amin
    Kim, Gwantae
    Ko, Hanseok
    Lee, Kanghyu
    Lee, Jungwon
    Li, Hao
    Liu, Ziluan
    Liu, Zhi-Song
    Liu, Shuai
    Lu, Yunhua
    Meng, Zibo
    Michelini, Pablo Navarrete
    Micheloni, Christian
    Prajapati, Kalpesh
    Ren, Haoyu
    Seo, Yong Hyeok
    Siu, Wan-Chi
    Sohn, Kyung-Ah
    Tai, Ying
    Umer, Rao Muhammad
    Wang, Shuangquan
    Wang, Huibing
    Wu, Timothy Haoning
    Wu, Haoning
    Yang, Biao
    Yang, Fuzhi
    Yoo, Jaejun
    Zhao, Tongtong
    Zhou, Yuanbo
    Zhuo, Haijie
    Zong, Ziyao
    Zou, Xueyi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2058 - 2076
  • [39] Semantic Segmentation Guided Real-World Super-Resolution
    Aakerberg, Andreas
    Johansen, Anders S.
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 449 - 458
  • [40] StarSRGAN: Improving Real-World Blind Super-Resolution
    Vo K.D.
    Bui L.T.
    Computer Science Research Notes, 2023, 31 (1-2): : 62 - 72