Multi-view 3D object retrieval leveraging the aggregation of view and instance attentive features

被引:12
|
作者
Lin, Dongyun [1 ]
Li, Yiqun [1 ]
Cheng, Yi [1 ]
Prasad, Shitala [1 ]
Nwe, Tin Lay [1 ]
Dong, Sheng [1 ]
Guo, Aiyuan [1 ]
机构
[1] ASTAR, Inst Infocomm Res, 1 Fusionopolis Way,21-01 Connexis South Tower, Singapore 138632, Singapore
关键词
View-based 3D object retrieval; View attention module; Instance attention module; ArcFace loss; Cosine distance triplet -center loss; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1016/j.knosys.2022.108754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-view 3D object retrieval tasks, it is pivotal to aggregate visual features extracted from multiple view images to generate a discriminative representation for a 3D object. The existing multi-view convolutional neural network employs view pooling for feature aggregation, which ignores the local view-relevant discriminative information within each view image and the global correlative information across all view images. To leverage both types of information, we propose two self -attention modules, namely, View Attention Module and Instance Attention Module, to learn view and instance attentive features, respectively. The final representation of a 3D object is the aggregation of three features: original, view-attentive, and instance-attentive. Furthermore, we propose employing the ArcFace loss together with the cosine-distance-based triplet-center loss as the metric learning guidance to train our model. As the cosine distance is used to rank the retrieval results, our angular metric learning losses achieve a consistent objective between the training and testing processes, thereby facilitating discriminative feature learning. Extensive experiments and ablation studies are conducted on four publicly available datasets on 3D object retrieval to show the superiority of the proposed method over multiple state-of-the-art methods. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Group-Pair Convolutional Neural Networks for Multi-View Based 3D Object Retrieval
    Gao, Zan
    Wang, Deyu
    He, Xiangnan
    Zhang, Hua
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2223 - 2231
  • [42] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212
  • [43] Multi-View Object Class Detection with a 3D Geometric Model
    Liebelt, Joerg
    Schmid, Cordelia
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1688 - 1695
  • [44] Multi-view Harmonized Bilinear Network for 3D Object Recognition
    Yu, Tan
    Meng, Jingjing
    Yuan, Junsong
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 186 - 194
  • [45] SCA-PVNet: Self-and-cross attention based aggregation of point cloud and multi-view for 3D object retrieval
    Lin, Dongyun
    Cheng, Yi
    Guo, Aiyuan
    Mao, Shangbo
    Li, Yiqun
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [46] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
    Gurbuz, Sabri
    Yano, Sumio
    Iwasawa, Shoichiro
    Ando, Hiroshi
    IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
  • [47] Multi-view 3D model retrieval based on enhanced detail features with contrastive center loss
    Chen, Qiang
    Chen, Yinong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (08) : 10407 - 10426
  • [48] Multi-view 3D model retrieval based on enhanced detail features with contrastive center loss
    Qiang Chen
    Yinong Chen
    Multimedia Tools and Applications, 2022, 81 : 10407 - 10426
  • [49] Learning Multi-view Deep Features for Small Object Retrieval in Surveillance Scenarios
    Guo, Haiyun
    Wang, Jinqiao
    Xu, Min
    Zha, Zheng-Jun
    Lu, Hanqing
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 859 - 862
  • [50] View-based 3D model retrieval via supervised multi-view feature learning
    An-An Liu
    Yang Shi
    Wei-Zhi Nie
    Yu-Ting Su
    Multimedia Tools and Applications, 2018, 77 : 3229 - 3243