Camera-LiDAR Cross-Modality Gait Recognition

被引:0
|
作者
Guo, Wenxuan [1 ]
Liang, Yingping [2 ]
Pan, Zhiyu [1 ]
Xi, Ziheng [1 ]
Feng, Jianjiang [1 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Automat, BNRist, Beijing, Peoples R China
[2] Beijing Inst Technol, Beijing, Peoples R China
来源
关键词
Gait recognition; Cross-modality; Contrastive pre-training;
D O I
10.1007/978-3-031-72754-2_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gait recognition is a crucial biometric identification technique. Camera-based gait recognition has been widely applied in both research and industrial fields. LiDAR-based gait recognition has also begun to evolve most recently, due to the provision of 3D structural information. However, in certain applications, cameras fail to recognize persons, such as in low-light environments and long-distance recognition scenarios, where LiDARs work well. On the other hand, the deployment cost and complexity of LiDAR systems limit its wider application. Therefore, it is essential to consider cross-modality gait recognition between cameras and LiDARs for a broader range of applications. In this work, we propose the first cross-modality gait recognition framework between Camera and LiDAR, namely CL-Gait. It employs a two-stream network for feature embedding of both modalities. This poses a challenging recognition task due to the inherent matching between 3D and 2D data, exhibiting significant modality discrepancy. To align the feature spaces of the two modalities, i.e., camera silhouettes and LiDAR points, we propose a contrastive pre-training strategy to mitigate modality discrepancy. To make up for the absence of paired camera-LiDAR data for pre-training, we also introduce a strategy for generating data on a large scale. This strategy utilizes monocular depth estimated from single RGB images and virtual cameras to generate pseudo point clouds for contrastive pre-training. Extensive experiments show that the cross-modality gait recognition is very challenging but still contains potential and feasibility with our proposed model and pre-training strategy. To the best of our knowledge, this is the first work to address cross-modality gait recognition. The code and dataset are available at https://github.com/GWxuan/CL-Gait.
引用
收藏
页码:439 / 455
页数:17
相关论文
共 50 条
  • [31] Cross-modality online distillation for multi-view action recognition
    Xu, Chao
    Wu, Xia
    Li, Yachun
    Jin, Yining
    Wang, Mengmeng
    Liu, Yong
    NEUROCOMPUTING, 2021, 456 : 384 - 393
  • [32] Pedestrian Recognition through Different Cross-Modality Deep Learning Methods
    Pop, Danut Ovidiu
    Rogozan, Alexandrina
    Nashashibi, Fawzi
    Bensrhair, Abdelaziz
    2017 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY (ICVES), 2017, : 133 - 138
  • [33] Ship detection and recognition in SAR images with cross-modality domain adaption
    Song Y.
    Li J.
    Tian T.
    Tian J.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 107 - 113
  • [34] Significance of Image Features in Camera-LiDAR Based Object Detection
    Csontho, Mihaly
    Rovid, Andras
    Szalay, Zsolt
    IEEE ACCESS, 2022, 10 : 61034 - 61045
  • [35] Spatiotemporal Camera-LiDAR Calibration: A Targetless and Structureless Approach
    Park, Chanoh
    Moghadam, Peyman
    Kim, Soohwan
    Sridharan, Sridha
    Fookes, Clinton
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 1556 - 1563
  • [36] Camera-LiDAR Data Fusion for Autonomous Mooring Operation
    Subedi, Dipendra
    Jha, Ajit
    Tyapin, Ilya
    Hovland, Geir
    PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 1176 - 1181
  • [37] Camera-LiDAR Fusion for Object Detection,Tracking and Prediction
    Huang Y.
    Zhou J.
    Huang Q.
    Li B.
    Wang L.
    Zhu J.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (06): : 945 - 951
  • [38] CMDFusion: Bidirectional Fusion Network With Cross-Modality Knowledge Distillation for LiDAR Semantic Segmentation
    Cen, Jun
    Zhang, Shiwei
    Pei, Yixuan
    Li, Kun
    Zheng, Hang
    Luo, Maochun
    Zhang, Yingya
    Chen, Qifeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 771 - 778
  • [39] Camera-LIDAR Integration: Probabilistic Sensor Fusion for Semantic Mapping
    Berrio, Julie Stephany
    Shan, Mao
    Worrall, Stewart
    Nebot, Eduardo
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7637 - 7652
  • [40] Hybrid-Residual-Based Odometry for Camera-LiDAR Systems
    Shi, Chenghao
    Huang, Kaihong
    Yu, Qinghua
    Xiao, Junhao
    Lu, Huimin
    Xie, Chenggang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3847 - 3852