Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments

被引:5
|
作者
Xu, Chi [1 ,2 ,3 ]
Zhou, Jun [1 ,2 ]
Cai, Wendi [1 ,2 ]
Jiang, Yunkai [1 ,2 ]
Li, Yongbo [1 ,2 ]
Liu, Yi [4 ,5 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China
[4] CRRC Zhuzhou Elect Locomot Co Ltd, Zhuzhou 412000, Peoples R China
[5] Natl Innovat Ctr Adv Rail Transit Equipment, Zhuzhou 412000, Peoples R China
基金
中国国家自然科学基金;
关键词
3D hand detection; RGB-D sensor; human– computer interaction; unseen lighting condition; adaptive RGB-D fusion; OBJECT DETECTION; RECOGNITION; NETWORK;
D O I
10.3390/s20216360
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 50 条
  • [21] Frustum PointNets for 3D Object Detection from RGB-D Data
    Qi, Charles R.
    Liu, Wei
    Wu, Chenxia
    Su, Hao
    Guibas, Leonidas J.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 918 - 927
  • [22] Faster 3D Object Detection in RGB-D Image Using 3D Selective Search and Object Pruning
    Liu, Jiang
    Chen, Hongliang
    Li, Jianxun
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4862 - 4866
  • [23] SL3D-Single Look 3D Object Detection based on RGB-D Images
    Erabati, Gopi Krishna
    Araujo, Helder
    2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [24] CFAM: Estimating 3D Hand Poses from a Single RGB Image with Attention
    Wang, Xianghan
    Jiang, Jie
    Guo, Yanming
    Kang, Lai
    Wei, Yingmei
    Li, Dan
    APPLIED SCIENCES-BASEL, 2020, 10 (02):
  • [25] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36
  • [26] Hand detection with RGB-D data from kinect sensor
    Zhang, Weizhong
    Wang, Guodong
    Liu, Cunliang
    Jia, Shiyu
    Yang, Jinbao
    Wang, Jun
    Journal of Information and Computational Science, 2015, 12 (10): : 3755 - 3763
  • [27] Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation
    Liu, Xingyu
    Ren, Pengfei
    Gao, Yuanyuan
    Wang, Jingyu
    Sun, Haifeng
    Qi, Qi
    Zhuang, Zirui
    Liao, Jianxin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3756 - 3764
  • [28] A Robust Human Pointing Location Estimation Using 3D Hand and Face Poses with RGB-D Sensor
    Kim, Donghun
    Hong, Kihyun
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 556 - 557
  • [29] Robust and Efficient RGB-D SLAM in Dynamic Environments
    Yang, Xin
    Yuan, Zikang
    Zhu, Dongfu
    Chi, Cheng
    Li, Kun
    Liao, Chunyuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 4208 - 4219
  • [30] Robust RGB-D Fusion for Saliency Detection
    Wu, Zongwei
    Gobichettipalayam, Shriarulmozhivarman
    Tamadazte, Brahim
    Allibert, Guillaume
    Paudel, Danda Pani
    Demonceaux, Cedric
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 403 - 413