Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

被引:0
|
作者
Kokic, Mia [1 ]
Kragic, Danica [1 ]
Bohg, Jeannette [2 ]
机构
[1] KTH, EECS, Div Robot Percept & Learning, Stockholm, Sweden
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
基金
瑞典研究理事会;
关键词
GRASP;
D O I
10.1109/iros40897.2019.8967961
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a system for modeling hand-object interactions in 3D from RGB images that show a hand which is holding a novel object from a known category. We design a Convolutional Neural Network (CNN) for Hand-held Object Pose and Shape estimation called HOPS-Net and utilize prior work to estimate the hand pose and configuration. We leverage the insight that information about the hand facilitates object pose and shape estimation by incorporating the hand into both training and inference of the object pose and shape as well as the refinement of the estimated pose. The network is trained on a large synthetic dataset of objects in interaction with a human hand. To bridge the gap between real and synthetic images, we employ an image-to-image translation model (Augmented CycleGAN) that generates realistically textured objects given a synthetic rendering. This provides a scalable way of generating annotated data for training HOPS-Net. Our quantitative experiments show that even noisy hand parameters significantly help object pose and shape estimation. The qualitative experiments show results of pose and shape estimation of objects held by a hand "in the wild".
引用
收藏
页码:3980 / 3987
页数:8
相关论文
共 50 条
  • [21] Crystal Palace: Merging Virtual Objects and Physical Hand-held Tools
    Kashiwagi, Toshiro
    Sumi, Kaoru
    Fels, Sidney
    Zhou, Qian
    Wu, Fan
    2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 1411 - 1412
  • [22] A Hybrid Genetic Algorithm for Relative Pose Estimation Captured by Hand-held Camera
    Zhou, Yongjun
    Deng, Caihua
    INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY AND ENVIRONMENT PROTECTION (ICSEEP 2015), 2015, : 811 - 819
  • [23] A RESPIRATORY PROFILE FROM A HAND-HELD COMPUTER
    AQUINO, MM
    HEART & LUNG, 1985, 14 (01): : 88 - 90
  • [24] Hand-held methane detector from Crowcon
    不详
    INTERNATIONAL GAS ENGINEERING AND MANAGEMENT, 2006, 46 (09): : 35 - 35
  • [25] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36
  • [26] RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion
    Lv, Xiong
    Jiang, Shu-Qiang
    Herranz, Luis
    Wang, Shuang
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (02) : 340 - 352
  • [27] RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion
    Xiong Lv
    Shu-Qiang Jiang
    Luis Herranz
    Shuang Wang
    Journal of Computer Science and Technology, 2015, 30 : 340 - 352
  • [28] Realistic surface geometry reconstruction using a hand-held RGB-D camera
    Kyoung-Rok Lee
    Truong Nguyen
    Machine Vision and Applications, 2016, 27 : 377 - 385
  • [29] Estimation of Hand Pressure and Pose From RGB Images Based on Cross-Modal Cues
    Tang, Wei
    Shao, Liangjing
    Chen, Xinrong
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 2030 - 2039
  • [30] Realistic surface geometry reconstruction using a hand-held RGB-D camera
    Lee, Kyoung-Rok
    Truong Nguyen
    MACHINE VISION AND APPLICATIONS, 2016, 27 (03) : 377 - 385