EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision

被引:0
|
作者
Qu, Qiang [1 ]
Chen, Xiaoming [2 ]
Chung, Yuk Ying [1 ]
Shen, Yiran [3 ]
机构
[1] Univ Sydney, Sch Comp Sci, Sydney, NSW 2050, Australia
[2] Beijing Technol & Business Univ, Sch Comp & Artificial Intelligence, Beijing 102401, Peoples R China
[3] Shandong Univ, Sch Software, Jinan 250100, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Cameras; Event detection; Optical flow; Estimation; Self-supervised learning; Noise; Computer vision; Accuracy; Generators; Noise reduction; Dynamic vision sensor; neuromorphic vision; event camera; representation learning; event-based vision; SENSOR;
D O I
10.1109/TIP.2024.3497795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event-stream representation is the first step for many computer vision tasks using event cameras. It converts the asynchronous event-streams into a formatted structure so that conventional machine learning models can be applied easily. However, most of the state-of-the-art event-stream representations are manually designed and the quality of these representations cannot be guaranteed due to the noisy nature of event-streams. In this paper, we introduce a data-driven approach aiming at enhancing the quality of event-stream representations. Our approach commences with the introduction of a new event-stream representation based on spatial-temporal statistics, denoted as EvRep. Subsequently, we theoretically derive the intrinsic relationship between asynchronous event-streams and synchronous video frames. Building upon this theoretical relationship, we train a representation generator, RepGen, in a self-supervised learning manner accepting EvRep as input. Finally, the event-streams are converted to high-quality representations, termed as EvRepSL, by going through the learned RepGen (without the need of fine-tuning or retraining). Our methodology is rigorously validated through extensive evaluations on a variety of mainstream event-based classification and optical flow datasets (captured with various types of event cameras). The experimental results highlight not only our approach's superior performance over existing event-stream representations but also its versatility, being agnostic to different event cameras and tasks.
引用
收藏
页码:6579 / 6591
页数:13
相关论文
共 50 条
  • [31] Learning Adaptive Parameter Representation for Event-Based Video Reconstruction
    Gu, Daxin
    Li, Jia
    Zhu, Lin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1950 - 1954
  • [32] SD2Event:Self-supervised Learning of Dynamic Detectors and Contextual Descriptors for Event Cameras
    Gao, Yuan
    Zhu, Yuqing
    Li, Xinjun
    Du, Yimin
    Zhang, Tianzhu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3055 - 3064
  • [33] Self-Supervised Representation Learning for CAD
    Jones, Benjamin T.
    Hu, Michael
    Kodnongbua, Milin
    Kim, Vladimir G.
    Schulz, Adriana
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
  • [34] Event-based Features for Robotic Vision
    Lagorce, Xavier
    Ieng, Sio-Hoi
    Benosman, Ryad
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 4214 - 4219
  • [35] Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
    Toering, Martine
    Gatopoulos, Ioannis
    Stol, Maarten
    Hu, Vincent Tao
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 846 - 856
  • [36] Evidential Self-Supervised Graph Representation Learning via Prototype-based Consistency
    Ju, Wei
    Yi, Siyu
    Zhang, Ming
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 210 - 211
  • [37] SELF-SUPERVISED VISION TRANSFORMERS FOR JOINT SAR-OPTICAL REPRESENTATION LEARNING
    Wang, Yi
    Albrecht, Conrad M.
    Zhu, Xiao Xiang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 139 - 142
  • [38] Event-based Vision meets Deep Learning on Steering Prediction for Self-driving Cars
    Maqueda, Ana I.
    Loquercio, Antonio
    Gallego, Guillermo
    Garcia, Narciso
    Scaramuzza, Davide
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5419 - 5427
  • [39] Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation
    Arsomngern, Pattaramanee
    Long, Cheng
    Suwajanakorn, Supasorn
    Nutanong, Sarana
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 1201 - 1216
  • [40] Self-Supervised Graph Learning With Hyperbolic Embedding for Temporal Health Event Prediction
    Lu, Chang
    Reddy, Chandan K.
    Ning, Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (04) : 2124 - 2136