Lifting Monocular Events to 3D Human Poses

被引:9
|
作者
Scarpellini, Gianluca [1 ,2 ]
Morerio, Pietro [1 ]
Del Bue, Alessio [1 ,3 ]
机构
[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis, Genoa, Italy
[2] Univ Genoa, Genoa, Italy
[3] Ist Italiano Tecnol, Visual Geometry & Modelling, Genoa, Italy
关键词
D O I
10.1109/CVPRW53098.2021.00150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel 3D human pose estimation approach using a single stream of asynchronous events as input. Most of the state-of-the-art approaches solve this task with RGB cameras, however struggling when subjects are moving fast. On the other hand, event-based 3D pose estimation benefits from the advantages of event-cameras, especially their efficiency and robustness to appearance changes. Yet, finding human poses in asynchronous events is in general more challenging than standard RGB pose estimation, since little or no events are triggered in static scenes. Here we propose the first learning-based method for 3D human pose from a single stream of events. Our method consists of two steps. First, we process the event-camera stream to predict three orthogonal heatmaps per joint; each heatmap is the projection of of the joint onto one orthogonal plane. Next, we fuse the sets of heatmaps to estimate 3D localisation of the body joints. As a further contribution, we make available a new, challenging dataset for event-based human pose estimation by simulating events from the RGB Human3.6m dataset. Experiments demonstrate that our method achieves solid accuracy, narrowing the performance gap between standard RGB and event-based vision. The code is freely available at https://iit-pavis.github.io/lifting_events_to_3d_hpe.
引用
收藏
页码:1358 / 1368
页数:11
相关论文
共 50 条
  • [1] Modeling 3D Human Poses from Uncalibrated Monocular Images
    Wei, Xiaolin K.
    Chai, Jinxiang
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1873 - 1880
  • [2] Forecasting Characteristic 3D Poses of Human Actions
    Diller, Christian
    Funkhouser, Thomas
    Dai, Angela
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15893 - 15902
  • [3] LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses
    Stier, Noah
    Angles, Baptiste
    Yang, Liang
    Yan, Yajie
    Colburn, Alex
    Chuang, Ming
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7887 - 7896
  • [4] Monocular 3D Reconstruction of Human Body
    Zhang, Yuqi
    Li, Dewei
    Jin, Bihui
    Ku, Yunwen
    Xue, Shibei
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7889 - 7894
  • [5] PoseScript: 3D Human Poses from Natural Language
    Delmas, Ginger
    Weinzaepfel, Philippe
    Lucas, Thomas
    Moreno-Noguer, Francesc
    Rogez, Gregory
    COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 346 - 362
  • [6] PoseFix: Correcting 3D Human Poses with Natural Language
    Delmas, Ginger
    Weinzaepfel, Philippe
    Moreno-Noguer, Francesc
    Rogez, Gregory
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14972 - 14982
  • [7] DEFORMATION TRANSFER OF 3D HUMAN SHAPES AND POSES ON MANIFOLDS
    Shabayek, Abd El Rahman
    Aouada, Djamila
    Saint, Alexandre
    Ottersten, Bjorn
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 220 - 224
  • [8] A survey on monocular 3D human pose estimation
    Ji X.
    Fang Q.
    Dong J.
    Shuai Q.
    Jiang W.
    Zhou X.
    Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500
  • [9] MONOCULAR 3D HUMAN POSE ESTIMATION BY CLASSIFICATION
    Greif, Thomas
    Lienhart, Rainer
    Sengupta, Debabrata
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [10] 3D Human Pose Lifting with Grid Convolution
    Kang, Yangyuxuan
    Liu, Yuyang
    Yao, Anbang
    Wang, Shandong
    Wu, Enhua
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 1105 - 1113