Real-Time Reinforcement Learning for Optimal Viewpoint Selection in Monocular 3D Human Pose Estimation

被引:0
|
作者
Lee, Sanghyeon [1 ]
Hwang, Yoonho [1 ]
Lee, Jong Taek [1 ]
机构
[1] Kyungpook Natl Univ, Sch Comp Sci & Engn, Daegu 41566, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Three-dimensional displays; Cameras; Real-time systems; Accuracy; Pose estimation; Heating systems; Uncertainty; Drones; Solid modeling; Feature extraction; 3D human pose estimation; next best viewpoint selection; deep learning; reinforcement learning;
D O I
10.1109/ACCESS.2024.3514146
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monocular 3D human pose estimation (HPE) presents an inherently ill-posed challenge, complicated by issues such as depth ambiguity and uncertainty. Estimating 3D poses with a single camera heavily depends on viewpoint, resulting in poor pose estimation accuracy. To address these challenges, we propose a real-time reinforcement learning-based viewpoint selection method that dynamically adjusts the camera viewpoint to optimize pose estimation. Our method extracts features encoding depth ambiguity and uncertainty from 2D-to-3D lifting, allowing the model to identify the optimal camera movements without requiring multiple cameras. We evaluate our approach on a publicly available real-world dataset, adjusted to simulate a realistic setting of drone flights capturing human motions. Our approach, compared against baseline strategies including fixed, random, and rotating camera movements with various 3D HPE models, significantly enhances the accuracy and robustness of pose estimation. In particular, it achieves a notable improvement, reducing pose estimation errors by over 30% compared to fixed and random camera movements. These results highlight the effectiveness of our method in optimizing viewpoint selection for real-time 3D HPE, making it a practical solution for single-camera setups in dynamic environments. Our code is available at https://github.com/knu-vis/nbv-pose.
引用
收藏
页码:191020 / 191029
页数:10
相关论文
共 50 条
  • [21] Aqua3DNet: Real-time 3D pose estimation of livestock in aquaculture by monocular machine vision
    Koh, Ming En
    Fong, Mark Wong Kei
    Ng, Eddie Yin Kwee
    AQUACULTURAL ENGINEERING, 2023, 103
  • [22] Real-Time Monocular Pose Estimation of 3D Objects using Temporally Consistent Local Color Histograms
    Tjaden, Henning
    Schwanecke, Ulrich
    Schoemer, Elmar
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 124 - 132
  • [23] Multi-Task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition
    Luvizon, Diogo C.
    Picard, David
    Tabia, Hedi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) : 2752 - 2764
  • [24] Generalizing Monocular 3D Human Pose Estimation in the Wild
    Wang, Luyang
    Chen, Yan
    Guo, Zhenhua
    Qian, Keyuan
    Lin, Mude
    Li, Hongsheng
    Ren, Jimmy S.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4024 - 4033
  • [25] Fast 3D Hand Pose Estimation for Real-time System
    Song, Jae-Hun
    Kang, Suk-Ju
    2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 121 - 122
  • [26] Modeling vs. Learning Approaches for Monocular 3D Human Pose Estimation
    Gong, Wenjuan
    Brauer, Juergen
    Arens, Michael
    Gonzalez, Jordi
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [27] Learning with privileged stereo knowledge for monocular absolute 3D human pose estimation
    Bian, Cunling
    Lu, Weigang
    Feng, Wei
    Wang, Song
    PATTERN RECOGNITION LETTERS, 2025, 189 : 143 - 149
  • [28] Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 956 - 970
  • [29] RGBD-Based Real-Time 3D Human Pose Estimation for Fitness Assessment
    Jiang, Yujie
    Cao, Chuang
    Zhu, Xiaoxiao
    Ma, Yanhong
    Cao, Qixin
    2020 3RD WORLD CONFERENCE ON MECHANICAL ENGINEERING AND INTELLIGENT MANUFACTURING (WCMEIM 2020), 2020, : 103 - 108
  • [30] TransPose: Real-time 3D Human Translation and Pose Estimation with Six Inertial Sensors
    Yi, Xinyu
    Zhou, Yuxiao
    Xu, Feng
    ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04):