STARNet: Low-light video enhancement using spatio-temporal consistency aggregation

被引:0
|
作者
Wu, Zhe [1 ]
Sheng, Zehua [1 ]
Zhang, Xue [1 ]
Cao, Si-Yuan [2 ]
Zhang, Runmin [1 ]
Yu, Beinan [3 ,4 ]
Zhang, Chenghao [1 ]
Yang, Bailin [5 ]
Shen, Hui-Liang [1 ,6 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China
[2] Zhejiang Univ, Ningbo Innovat Ctr, Ningbo, Peoples R China
[3] Zhejiang Univ, Jinhua Inst, Jinhua, Zhejiang, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[5] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
[6] Key Lab Collaborat Sensing & Autonomous Unmanned S, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Low-light enhancement; Image processing; Video enhancement; Spatio-temporal aggregation; NETWORK;
D O I
10.1016/j.patcog.2024.111180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In low-light environments, capturing high-quality videos is an imaging challenge due to the limited number of photons. Previous low-light enhancement approaches usually result in over-smoothed details, temporal flickers, and color deviation. We propose STARNet, an end-to-end video enhancement network that leverages temporal consistency aggregation to address these issues. We introduce a spatio-temporal consistency aggregator, which extracts structures from multiple frames in hidden space to overcome detail corruption and temporal flickers. It parameterizes neighboring frames to extract and align consistent features, and then selectively fuses consistent features to restore clear structures. To further enhance temporal consistency, we develop a local temporal consistency constraint with robustness against the warping error from motion estimation. Furthermore, we employ a normalized low-frequency color constraint to regularize the color as the normal-light condition. Extensive experimental results on real datasets show that the proposed method achieves better detail fidelity, color accuracy, and temporal consistency, outperforming state-of-the-art approaches.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Historical spatio-temporal aggregation
    Tao, Y
    Papadias, D
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2005, 23 (01) : 61 - 102
  • [32] Low-Rank Tensor Completion with Spatio-Temporal Consistency
    Wang, Hua
    Nie, Feiping
    Huang, Heng
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2846 - 2852
  • [33] A Novel Framework for Extremely Low-light Video Enhancement
    Kim, Minjac
    Park, Dubok
    Han, David K.
    Ko, Hanscok
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 93 - 94
  • [34] Low-Light Video Enhancement with Synthetic Event Guidance
    Liu, Lin
    An, Junfeng
    Liu, Jianzhuang
    Yuan, Shanxin
    Chen, Xiangyu
    Zhou, Wengang
    Li, Houqiang
    Wang, Yan Feng
    Tian, Qi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1692 - 1700
  • [35] Modeling consistency of spatio-temporal graphs
    Del Mondo, G.
    Rodriguez, M. A.
    Claramunt, C.
    Bravo, L.
    Thibaud, R.
    DATA & KNOWLEDGE ENGINEERING, 2013, 84 : 59 - 80
  • [36] Spatio-temporal Sampling for Video
    Shankar, Mohan
    Pitsiauis, Nikos P.
    Brady, David
    IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
  • [37] Video Captioning With Object-Aware Spatio-Temporal Correlation and Aggregation
    Zhang, Junchao
    Peng, Yuxin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 6209 - 6222
  • [38] Learning Spatio-Temporal Aggregation for Fetal Heart Analysis in Ultrasound Video
    Patra, Arijit
    Huang, Weilin
    Noble, J. Alison
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2017, 10553 : 276 - 284
  • [39] Unsupervised Low-Light Video Enhancement With Spatial-Temporal Co-Attention Transformer
    Lv, Xiaoqian
    Zhang, Shengping
    Wang, Chenyang
    Zhang, Weigang
    Yao, Hongxun
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4701 - 4715
  • [40] A low-light video enhancement approach using novel intuitionistic fuzzy generator
    Chinnappan, Ravindar Raj
    Sundaram, Dhanasekar
    EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2024,