A Smart Dual-modal Aligned Transformer Deep Network for Robotic Grasp Detection

被引:0
|
作者
Cang, Xin [1 ]
Zhang, Haojun [1 ]
Yang, Yuequan [1 ]
Cao, Zhiqiang [2 ]
Li, Fudong [1 ]
Zhu, Jiaming [1 ]
机构
[1] Yangzhou Univ, Sch Informat Engn, Yangzhou, Jiangsu, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Dual modalities; Feature alignment; Robotic grasping; Transformer;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robotic grasp is one of crucial visual tasks for service robots as well as industrial robots. The existing deep vision learning approaches for robotic grasp most utilize RGB-D as single modality or indiscriminating usage of them, which often overlook the valuable depth information in RGB-D images. To address this limitation, this paper proposes a smart dual-modal aligned transformer deep network (SATNet), which is not only very lightweight but also well performed for robotic grasping tasks using RGB-D images. Specifically, a novel ATFormer module with the two parallel aligned transformer encoder blocks are elaborated to fuse global feature maps efficiently. The experiments on Cornell dataset demonstrate that the proposed model outperforms existing methods, which not only enjoys impressively lightweight framework with only 0.27M parameters, but also achieves accuracy of 97.8% and inference time of 16.3ms.
引用
收藏
页码:1230 / 1235
页数:6
相关论文
共 50 条
  • [1] Dual-Modal Information Bottleneck Network for Seizure Detection
    Wang, Jiale
    Ge, Xinting
    Shi, Yunfeng
    Sun, Mengxue
    Gong, Qingtao
    Wang, Haipeng
    Huang, Wenhui
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (01)
  • [2] Robotic Grasp Detection Based on Transformer
    Dong, Mingshuai
    Bai, Yuxuan
    Wei, Shimin
    Yu, Xiuli
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT IV, 2022, 13458 : 437 - 448
  • [3] Object Detection Algorithm Based on Dual-modal Fusion Network
    Sun Ying
    Hou Zhiqiang
    Yang Chen
    Ma Sugang
    Fan Jiulun
    ACTA PHOTONICA SINICA, 2023, 52 (01)
  • [4] IMAGE FUSION NETWORK FOR DUAL-MODAL RESTORATION
    Zhang, Ying
    Ren, Xuhua
    Clifford, Bryan Alexander
    Wang, Qian
    Zhang, Xiaoqun
    INVERSE PROBLEMS AND IMAGING, 2021, 15 (06) : 1409 - 1419
  • [5] Dual-modal aptasensor for the detection of isocarbophos in vegetables
    Wang, Rong-Hua
    Zhu, Cheng-Long
    Wang, Ling-Ling
    Xu, Li-Zhi
    Wang, Wen-Long
    Yang, Cheng
    Zhang, Yi
    TALANTA, 2019, 205
  • [6] Compressed Video Action Recognition With Dual-Stream and Dual-Modal Transformer
    Mou, Yuting
    Jiang, Xinghao
    Xu, Ke
    Sun, Tanfeng
    Wang, Zepeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3299 - 3312
  • [7] Robust dual-modal image quality assessment aware deep learning network for traffic targets detection of autonomous vehicles
    Keke Geng
    Ge Dong
    Wenhan Huang
    Multimedia Tools and Applications, 2022, 81 : 6801 - 6826
  • [8] Robust dual-modal image quality assessment aware deep learning network for traffic targets detection of autonomous vehicles
    Geng, Keke
    Dong, Ge
    Huang, Wenhan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (05) : 6801 - 6826
  • [9] Quality-aware dual-modal saliency detection via deep reinforcement learning
    Wang, Xiao
    Sun, Tao
    Yang, Rui
    Li, Chenglong
    Luo, Bin
    Tang, Jin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 75 : 158 - 167
  • [10] Dual-Modal Drowsiness Detection to Enhance Driver Safety
    Chew, Yi Xuan
    Razak, Siti Fatimah Abdul
    Yogarayan, Sumendra
    Ismail, Sharifah Noor Masidayu Sayed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4397 - 4417