Deep multi-view representation learning for social images

被引：11

作者：

Huang, Feiran ^{[1
]}

Zhang, Xiaoming ^{[2
]}

Zhao, Zhonghua ^{[3
]}

Li, Zhoujun ^{[1
]}

He, Yueying ^{[3
]}

机构：

[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China

[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China

[3] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing 100029, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2018年 / 73卷

基金：

中国国家自然科学基金;

关键词：

Multi-view learning; Image embedding; Representation learning; Stacked autoencoder;

D O I：

10.1016/j.asoc.2018.08.010

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-view representation learning for social images has recently made remarkable achievements in many tasks, such as cross-view classification and cross-modal retrieval. Since social images usually contain link information besides the multi-modal contents (e.g., text description, and visual content), simply employing the data content may result in sub-optimal multi-view representation of the social images. In this paper, we propose a Deep Multi-View Embedding Model (DMVEM) to learn joint embeddings for the three views including the visual content, the associated text descriptions, and their relations. To effectively encode the link information, a weighted relation network is built based on the linkages between social images, which is then embedded into a low dimensional vector space using the Skip-Gram model. The learned vector is regarded as the third view besides the visual content and text description. To learn a joint representation from the three views, a deep learning model with three-branch nonlinear neural network is proposed. A three-view bi-directional loss function is used to capture the correlation between the three views. The stacked autoencoder is adopted to preserve the self-structure and reconstructability of the learned representation for each view. Comprehensive experiments are conducted in the tasks of image-to-text, text-to-image, and image-to-image searches. Compared to the state-of-the-art multi-view embedding methods, our approach achieves significant improvement of performance. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：106 / 118

页数：13

共 50 条

[21] Semi-supervised Deep Representation Learning for Multi-View Problems
Noroozi, Vahid
Bahaadini, Sara
Zheng, Lei
Xie, Sihong
Shao, Weixiang
Yu, Philip S.
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 56 - 64
[22] Unified Representation Learning for Multi-View Clustering by Between/Within View Deep Majorization
Zhang, Yue
Yang, Sirui
Huang, Weitian
Wang, Chang-Dong
Cai, Hongmin
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 615 - 626
[23] Unbalanced Multi-view Deep Learning
Xu, Cai
Li, Zehui
Guan, Ziyu
Zhao, Wei
Song, Xiangyu
Wu, Yue
Li, Jianxin
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3051 - 3059
[24] Deep Multi-View Concept Learning
Xu, Cai
Guan, Ziyu
Zhao, Wei
Niu, Yunfei
Wang, Quan
Wang, Zhiheng
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2898 - 2904
[25] Conversion of two dimensional images into multi-view images of bone using deep learning
Pradhan, Nitesh
Singh, Vaibhav
Kumar, Virat
Goel, Parth
Dhaka, Vijaypal Singh
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2021, 9 (01): : 106 - 113
[26] Deep Multi-View Learning to Rank
Cao, Guanqun
Iosifidis, Alexandros
Gabbouj, Moncef
Raghavan, Vijay
Gottumukkala, Raju
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1426 - 1438
[27] Deep Partial Multi-View Learning
Zhang, Changqing
Cui, Yajie
Han, Zongbo
Zhou, Joey Tianyi
Fu, Huazhu
Hu, Qinghua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2402 - 2415
[28] Deep Generative Multi-view Learning
Karami, Mahdi
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 1167 : 465 - 477
[29] Multi-View Concept Learning for Data Representation
Guan, Ziyu
Zhang, Lijun
Peng, Jinye
Fan, Jianping
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (11) : 3016 - 3028
[30] A survey on representation learning for multi-view data
Qin, Yalan
Zhang, Xinpeng
Yu, Shui
Feng, Guorui
NEURAL NETWORKS, 2025, 181

← 1 2 3 4 5 →