A Smart Dual-modal Aligned Transformer Deep Network for Robotic Grasp Detection

被引：0

作者：

Cang, Xin ^{[1
]}

Zhang, Haojun ^{[1
]}

Yang, Yuequan ^{[1
]}

Cao, Zhiqiang ^{[2
]}

Li, Fudong ^{[1
]}

Zhu, Jiaming ^{[1
]}

机构：

[1] Yangzhou Univ, Sch Informat Engn, Yangzhou, Jiangsu, Peoples R China

[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

来源：

2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

Dual modalities; Feature alignment; Robotic grasping; Transformer;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Robotic grasp is one of crucial visual tasks for service robots as well as industrial robots. The existing deep vision learning approaches for robotic grasp most utilize RGB-D as single modality or indiscriminating usage of them, which often overlook the valuable depth information in RGB-D images. To address this limitation, this paper proposes a smart dual-modal aligned transformer deep network (SATNet), which is not only very lightweight but also well performed for robotic grasping tasks using RGB-D images. Specifically, a novel ATFormer module with the two parallel aligned transformer encoder blocks are elaborated to fuse global feature maps efficiently. The experiments on Cornell dataset demonstrate that the proposed model outperforms existing methods, which not only enjoys impressively lightweight framework with only 0.27M parameters, but also achieves accuracy of 97.8% and inference time of 16.3ms.

引用

页码：1230 / 1235

页数：6

共 50 条

[1] Dual-Modal Information Bottleneck Network for Seizure Detection
Wang, Jiale
Ge, Xinting
Shi, Yunfeng
Sun, Mengxue
Gong, Qingtao
Wang, Haipeng
Huang, Wenhui
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (01)
[2] Robotic Grasp Detection Based on Transformer
Dong, Mingshuai
Bai, Yuxuan
Wei, Shimin
Yu, Xiuli
INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT IV, 2022, 13458 : 437 - 448
[3] Object Detection Algorithm Based on Dual-modal Fusion Network
Sun Ying
Hou Zhiqiang
Yang Chen
Ma Sugang
Fan Jiulun
ACTA PHOTONICA SINICA, 2023, 52 (01)
[4] IMAGE FUSION NETWORK FOR DUAL-MODAL RESTORATION
Zhang, Ying
Ren, Xuhua
Clifford, Bryan Alexander
Wang, Qian
Zhang, Xiaoqun
INVERSE PROBLEMS AND IMAGING, 2021, 15 (06) : 1409 - 1419
[5] Dual-modal aptasensor for the detection of isocarbophos in vegetables
Wang, Rong-Hua
Zhu, Cheng-Long
Wang, Ling-Ling
Xu, Li-Zhi
Wang, Wen-Long
Yang, Cheng
Zhang, Yi
TALANTA, 2019, 205
[6] Compressed Video Action Recognition With Dual-Stream and Dual-Modal Transformer
Mou, Yuting
Jiang, Xinghao
Xu, Ke
Sun, Tanfeng
Wang, Zepeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3299 - 3312
[7] Robust dual-modal image quality assessment aware deep learning network for traffic targets detection of autonomous vehicles
Keke Geng
Ge Dong
Wenhan Huang
Multimedia Tools and Applications, 2022, 81 : 6801 - 6826
[8] Robust dual-modal image quality assessment aware deep learning network for traffic targets detection of autonomous vehicles
Geng, Keke
Dong, Ge
Huang, Wenhan
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (05) : 6801 - 6826
[9] Quality-aware dual-modal saliency detection via deep reinforcement learning
Wang, Xiao
Sun, Tao
Yang, Rui
Li, Chenglong
Luo, Bin
Tang, Jin
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 75 : 158 - 167
[10] Dual-Modal Drowsiness Detection to Enhance Driver Safety
Chew, Yi Xuan
Razak, Siti Fatimah Abdul
Yogarayan, Sumendra
Ismail, Sharifah Noor Masidayu Sayed
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4397 - 4417

← 1 2 3 4 5 →