CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval

被引：22

作者：

Nie, Weizhi ^{[1
]}

Zhao, Yue ^{[1
]}

Nie, Jie ^{[2
]}

Liu, An-An ^{[1
]}

Zhao, Sicheng ^{[3
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China

[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Shape; Three-dimensional displays; Two dimensional displays; Feature extraction; Task analysis; Visualization; Computer architecture; Image processing; information retrieval; content-based retrieval; multimedia computing; MODEL RETRIEVAL; NEURAL-NETWORK; FEATURES;

D O I：

10.1109/TCSVT.2021.3070969

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Retrieving 3D shapes based on 2D images is a challenging research topic, due to the significant gap between different domains. Recently, various approaches have been proposed to handle this problem. However, the majority of methods target the cross-domain retrieval task as a pure domain adaptation problem, which focuses on the alignment but ignores the visual relevance between the 2D images and their corresponding 3D shapes. To fundamentally decrease the divergence between different domains, we propose a novel cross-domain learning network (CLN) for 2D image-based 3D shape retrieval task. First, we estimate the pose information from the 2D image to guide the view rendering of 3D shapes, which increases the visual correlations of the cross-domain data to eliminate the divergence between them. Second, we introduce a novel joint learning network, considering both the domain-specific characteristics and the cross-domain interactions for data alignment, which further compensates for the gap between different domains by controlling the distance of intra- and inter-classes. After the metric learning process, discriminative descriptors of images and shapes are generated for the cross-domain retrieval task. To prove the effectiveness and robustness of the proposed method, we conduct extensive experiments on the MI3DOR, SHREC'13, and SHREC'14 datasets. The experimental results demonstrate the superiority of our proposed method, and significant improvements have been achieved compared with state-of-the-art methods.

引用

页码：992 / 1005

页数：14

共 50 条

[31] An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
Yasin, Hashim
Krueger, Bjoern
SENSORS, 2021, 21 (07)
[32] Cross-Domain 3D Equivariant Image Embeddings
Esteves, Carlos
Sud, Avneesh
Luo, Zhengyi
Daniilidis, Kostas
Makadia, Ameesh
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[33] Image-based 3D model retrieval using manifold learning
Pan-pan MU
San-yuan ZHANG
Yin ZHANG
Xiu-zi YE
Xiang PAN
Frontiers of Information Technology & Electronic Engineering, 2018, 19 (11) : 1397 - 1408
[34] Image-based 3D model retrieval using manifold learning
Pan-pan Mu
San-yuan Zhang
Yin Zhang
Xiu-zi Ye
Xiang Pan
Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 1397 - 1408
[35] Image-based 3D model retrieval using manifold learning
Mu, Pan-pan
Zhang, San-yuan
Zhang, Yin
Ye, Xiu-zi
Pan, Xiang
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (11) : 1397 - 1408
[36] Domain-Specific Alignment Network for Multi-Domain Image-Based 3D Object Retrieval
Su, Yuting
Li, Yuqian
Song, Dan
Mao, Zhendong
Li, Xuanya
Liu, An-An
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3496 - 3504
[37] Learning Pairwise Neural Network Encoder for Depth Image-based 3D Model Retrieval
Zhu, Jing
Zhu, Fan
Wong, Edward K.
Fang, Yi
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1227 - 1230
[38] OPEN: Occlusion-Invariant Perception Network for Single Image-Based 3D Shape Retrieval
Chu, Fupeng
Cong, Yang
Chen, Ronghan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7998 - 8012
[39] M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval
Nie, Wei-Zhi
Ren, Min-Jie
Liu, An-An
Mao, Zhendong
Nie, Jie
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1962 - 1976
[40] Shape-aware speckle matching network for cross-domain 3D reconstruction
Dong, Yanzhen
Wu, Haitao
Yang, Xiao
Chen, Xiaobo
Xi, Juntong
NEUROCOMPUTING, 2024, 585

← 1 2 3 4 5 →