Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

被引:0
|
作者
Kokic, Mia [1 ]
Kragic, Danica [1 ]
Bohg, Jeannette [2 ]
机构
[1] KTH, EECS, Div Robot Percept & Learning, Stockholm, Sweden
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
基金
瑞典研究理事会;
关键词
GRASP;
D O I
10.1109/iros40897.2019.8967961
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a system for modeling hand-object interactions in 3D from RGB images that show a hand which is holding a novel object from a known category. We design a Convolutional Neural Network (CNN) for Hand-held Object Pose and Shape estimation called HOPS-Net and utilize prior work to estimate the hand pose and configuration. We leverage the insight that information about the hand facilitates object pose and shape estimation by incorporating the hand into both training and inference of the object pose and shape as well as the refinement of the estimated pose. The network is trained on a large synthetic dataset of objects in interaction with a human hand. To bridge the gap between real and synthetic images, we employ an image-to-image translation model (Augmented CycleGAN) that generates realistically textured objects given a synthetic rendering. This provides a scalable way of generating annotated data for training HOPS-Net. Our quantitative experiments show that even noisy hand parameters significantly help object pose and shape estimation. The qualitative experiments show results of pose and shape estimation of objects held by a hand "in the wild".
引用
收藏
页码:3980 / 3987
页数:8
相关论文
共 50 条
  • [41] THE SOURCES AND CONTROL OF VIBRATION FROM HAND-HELD GRINDERS
    CLARKE, JB
    DALBY, W
    JOURNAL OF SOUND AND VIBRATION, 1988, 121 (03) : 583 - 583
  • [42] CELLULAR RECEIVER METAMORPHOSIS - FROM VMEBUS TO HAND-HELD
    MENDELSOHN, A
    COMPUTER DESIGN, 1994, 33 (05): : 85 - 88
  • [43] Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images
    Cai, Yujun
    Ge, Liuhao
    Cai, Jianfei
    Yuan, Junsong
    COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 678 - 694
  • [44] Towards Automatic Estimation of the Body Condition Score of Dairy Cattle Using Hand-held Images and Active Shape Models
    Tedin, Rafael
    Becerra, J. A.
    Duro, Richard J.
    Martinez Lede, Ismael
    ADVANCES IN KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, 2012, 243 : 2150 - 2159
  • [45] Unsupervised Incremental Learning for Hand Shape and Pose Estimation
    Kalshetti, Pratik
    Chaudhuri, Parag
    SIGGRAPH '19 - ACM SIGGRAPH 2019 POSTERS, 2019,
  • [46] Computational Design of Hand-Held VR Controllers Using Haptic Shape Illusion
    Fujinawa, Eisuke
    Yoshida, Shigeo
    Koyama, Yuki
    Narumi, Takuji
    Tanikawa, Tomohiro
    Hirose, Michitaka
    VRST'17: PROCEEDINGS OF THE 23RD ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, 2017,
  • [47] Learning of Shape Models from Exemplars of Biological Objects in Images
    Perner, Petra
    ADVANCES IN COMPUTER VISION, CVC, VOL 1, 2020, 943 : 580 - 599
  • [48] A Study on Improving Close and Distant Device Movement Pose Manipulation for Hand-Held Augmented Reality
    Samini, Ali
    Palmeriust, Karljohan Lundin
    22ND ACM CONFERENCE ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY (VRST 2016), 2016, : 121 - 128
  • [49] HandFormer: Hand pose reconstructing from a single RGB image
    Jiao, Zixun
    Wang, Xihan
    Li, Jingcao
    Gao, Rongxin
    He, Miao
    Liang, Jiao
    Xia, Zhaoqiang
    Gao, Quanli
    PATTERN RECOGNITION LETTERS, 2024, 183 : 155 - 164
  • [50] Hand-held 3D scanner without sensor pose tracking or surface markers
    Kofman, J.
    Borribanbunpotkat, K.
    HIGH VALUE MANUFACTURING: ADVANCED RESEARCH IN VIRTUAL AND RAPID PROTOTYPING, 2014, : 429 - 434