Real-Time Reinforcement Learning for Optimal Viewpoint Selection in Monocular 3D Human Pose Estimation

被引:0
|
作者
Lee, Sanghyeon [1 ]
Hwang, Yoonho [1 ]
Lee, Jong Taek [1 ]
机构
[1] Kyungpook Natl Univ, Sch Comp Sci & Engn, Daegu 41566, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Three-dimensional displays; Cameras; Real-time systems; Accuracy; Pose estimation; Heating systems; Uncertainty; Drones; Solid modeling; Feature extraction; 3D human pose estimation; next best viewpoint selection; deep learning; reinforcement learning;
D O I
10.1109/ACCESS.2024.3514146
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monocular 3D human pose estimation (HPE) presents an inherently ill-posed challenge, complicated by issues such as depth ambiguity and uncertainty. Estimating 3D poses with a single camera heavily depends on viewpoint, resulting in poor pose estimation accuracy. To address these challenges, we propose a real-time reinforcement learning-based viewpoint selection method that dynamically adjusts the camera viewpoint to optimize pose estimation. Our method extracts features encoding depth ambiguity and uncertainty from 2D-to-3D lifting, allowing the model to identify the optimal camera movements without requiring multiple cameras. We evaluate our approach on a publicly available real-world dataset, adjusted to simulate a realistic setting of drone flights capturing human motions. Our approach, compared against baseline strategies including fixed, random, and rotating camera movements with various 3D HPE models, significantly enhances the accuracy and robustness of pose estimation. In particular, it achieves a notable improvement, reducing pose estimation errors by over 30% compared to fixed and random camera movements. These results highlight the effectiveness of our method in optimizing viewpoint selection for real-time 3D HPE, making it a practical solution for single-camera setups in dynamic environments. Our code is available at https://github.com/knu-vis/nbv-pose.
引用
收藏
页码:191020 / 191029
页数:10
相关论文
共 50 条
  • [1] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [2] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    Applied Intelligence, 2022, 52 : 14491 - 14506
  • [3] Deep learning-based real-time 3D human pose estimation
    Zhang, Xiaoyan
    Zhou, Zhengchun
    Han, Ying
    Meng, Hua
    Yang, Meng
    Rajasegarar, Sutharshan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [4] LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION
    Chen, Ziyi
    Sugimoto, Akihiro
    Lai, Shang-Hong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4218 - 4222
  • [5] G2O-Pose: Real-Time Monocular 3D Human Pose Estimation Based on General Graph Optimization
    Sun, Haixun
    Zhang, Yanyan
    Zheng, Yijie
    Luo, Jianxin
    Pan, Zhisong
    SENSORS, 2022, 22 (21)
  • [6] Real-Time 3D Pose Reconstruction of Human Body from Monocular Video Sequences
    Zhu, LiangJia
    Hwang, Jenq-Neng
    Chen, Chih-Chang
    Lin, Ming-Hui
    Yen, Chen-Lan
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 717 - +
  • [7] Real-time 3D human pose and motion reconstruction from monocular RGB videos
    Yiannakides, Anastasios
    Aristidou, Andreas
    Chrysanthou, Yiorgos
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2019, 30 (3-4)
  • [8] Downsizing Heatmap Resolution for real-time 3D Human Pose Estimation
    Kong, Dae-hyeon
    Kang, Suk-ju
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [9] REAL-TIME 3D RECONSTRUCTION AND POSE ESTIMATION FOR HUMAN MOTION ANALYSIS
    Graf, Holger
    Yoon, Sang Min
    Malerczyk, Cornelius
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3981 - 3984
  • [10] A survey on monocular 3D human pose estimation
    Ji X.
    Fang Q.
    Dong J.
    Shuai Q.
    Jiang W.
    Zhou X.
    Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500