Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

被引:0
|
作者
Xu, Jiaqi [1 ]
Wu, Mengyang [1 ]
Hu, Xiaowei [2 ]
Fu, Chi-Wing [1 ]
Dou, Qi [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
来源
基金
国家重点研发计划;
关键词
Adverse weather; Deraining; Dehazing; Desnowing;
D O I
10.1007/978-3-031-72649-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the limitations of adverse weather image restoration approaches trained on synthetic data when applied to real-world scenarios. We formulate a semi-supervised learning framework employing vision-language models to enhance restoration performance across diverse adverse weather conditions in real-world settings. Our approach involves assessing image clearness and providing semantics using vision-language models on real data, serving as supervision signals for training restoration models. For clearness enhancement, we use real-world data, utilizing a dual-step strategy with pseudo-labels assessed by vision-language models and weather prompt learning. For semantic enhancement, we integrate real-world data by adjusting weather conditions in vision-language model descriptions while preserving semantic meaning. Additionally, we introduce an effective training strategy to bootstrap restoration performance. Our approach achieves superior results in real-world adverse weather image restoration, demonstrated through qualitative and quantitative comparisons with state-of-the-art works.
引用
收藏
页码:147 / 164
页数:18
相关论文
共 7 条
  • [1] Leveraging vision-language prompts for real-world image restoration and enhancement
    Wei, Yanyan
    Zhang, Yilin
    Li, Kun
    Wang, Fei
    Tang, Shengeng
    Zhang, Zhao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 250
  • [2] Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
    Cheng, Kanzhi
    Song, Wenpo
    Ma, Zheng
    Zhu, Wenhao
    Zhu, Zixuan
    Zhang, Jianbing
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5038 - 5047
  • [3] Advancing Real-World Stereoscopic Image Super-Resolution via Vision-Language Model
    Zhang, Zhe
    Lei, Jianjun
    Peng, Bo
    Zhu, Jie
    Xu, Liying
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 2187 - 2197
  • [4] VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
    Bitton, Yonatan
    Bansal, Hritik
    Hessel, Jack
    Shao, Rulin
    Zhu, Wanrong
    Awadalla, Anas
    Gardner, Josh
    Taori, Rohan
    Schimdt, Ludwig
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data
    Varma, Maya
    Delbrouck, Jean-Benoit
    Hooper, Sarah
    Chaudhari, Akshay
    Langlotz, Curtis
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22168 - 22178
  • [6] Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery
    Wang, Chao
    Zheng, Zhedong
    Quan, Ruijie
    Yang, Yi
    COMPUTER VISION-ECCV 2024, PT LXXXII, 2025, 15140 : 379 - 397