Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

被引：0

作者：

Xu, Jiaqi ^{[1
]}

Wu, Mengyang ^{[1
]}

Hu, Xiaowei ^{[2
]}

Fu, Chi-Wing ^{[1
]}

Dou, Qi ^{[1
]}

Heng, Pheng-Ann ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

来源：

COMPUTER VISION-ECCV 2024, PT XVIII | 2025年 / 15076卷

基金：

国家重点研发计划;

关键词：

Adverse weather; Deraining; Dehazing; Desnowing;

D O I：

10.1007/978-3-031-72649-1_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the limitations of adverse weather image restoration approaches trained on synthetic data when applied to real-world scenarios. We formulate a semi-supervised learning framework employing vision-language models to enhance restoration performance across diverse adverse weather conditions in real-world settings. Our approach involves assessing image clearness and providing semantics using vision-language models on real data, serving as supervision signals for training restoration models. For clearness enhancement, we use real-world data, utilizing a dual-step strategy with pseudo-labels assessed by vision-language models and weather prompt learning. For semantic enhancement, we integrate real-world data by adjusting weather conditions in vision-language model descriptions while preserving semantic meaning. Additionally, we introduce an effective training strategy to bootstrap restoration performance. Our approach achieves superior results in real-world adverse weather image restoration, demonstrated through qualitative and quantitative comparisons with state-of-the-art works.

引用

页码：147 / 164

页数：18

共 7 条

[1] Leveraging vision-language prompts for real-world image restoration and enhancement
Wei, Yanyan
Zhang, Yilin
Li, Kun
Wang, Fei
Tang, Shengeng
Zhang, Zhao
COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 250
[2] Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Cheng, Kanzhi
Song, Wenpo
Ma, Zheng
Zhu, Wenhao
Zhu, Zixuan
Zhang, Jianbing
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5038 - 5047
[3] Advancing Real-World Stereoscopic Image Super-Resolution via Vision-Language Model
Zhang, Zhe
Lei, Jianjun
Peng, Bo
Zhu, Jie
Xu, Liying
Huang, Qingming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 2187 - 2197
[4] VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Bitton, Yonatan
Bansal, Hritik
Hessel, Jack
Shao, Rulin
Zhu, Wanrong
Awadalla, Anas
Gardner, Josh
Taori, Rohan
Schimdt, Ludwig
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[5] ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data
Varma, Maya
Delbrouck, Jean-Benoit
Hooper, Sarah
Chaudhari, Akshay
Langlotz, Curtis
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22168 - 22178
[6] Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery
Wang, Chao
Zheng, Zhedong
Quan, Ruijie
Yang, Yi
COMPUTER VISION-ECCV 2024, PT LXXXII, 2025, 15140 : 379 - 397
[7] Active vision and image/video understanding systems built upon network-symbolic models for perception-based navigation of mobile robots in real-world environments
Kuvich, G
MOBILE ROBOTS XVII, 2004, 5609 : 35 - 49

← 1 →