Deep multi-view representation learning for social images

被引:11
|
作者
Huang, Feiran [1 ]
Zhang, Xiaoming [2 ]
Zhao, Zhonghua [3 ]
Li, Zhoujun [1 ]
He, Yueying [3 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[3] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view learning; Image embedding; Representation learning; Stacked autoencoder;
D O I
10.1016/j.asoc.2018.08.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view representation learning for social images has recently made remarkable achievements in many tasks, such as cross-view classification and cross-modal retrieval. Since social images usually contain link information besides the multi-modal contents (e.g., text description, and visual content), simply employing the data content may result in sub-optimal multi-view representation of the social images. In this paper, we propose a Deep Multi-View Embedding Model (DMVEM) to learn joint embeddings for the three views including the visual content, the associated text descriptions, and their relations. To effectively encode the link information, a weighted relation network is built based on the linkages between social images, which is then embedded into a low dimensional vector space using the Skip-Gram model. The learned vector is regarded as the third view besides the visual content and text description. To learn a joint representation from the three views, a deep learning model with three-branch nonlinear neural network is proposed. A three-view bi-directional loss function is used to capture the correlation between the three views. The stacked autoencoder is adopted to preserve the self-structure and reconstructability of the learned representation for each view. Comprehensive experiments are conducted in the tasks of image-to-text, text-to-image, and image-to-image searches. Compared to the state-of-the-art multi-view embedding methods, our approach achieves significant improvement of performance. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:106 / 118
页数:13
相关论文
共 50 条
  • [21] Semi-supervised Deep Representation Learning for Multi-View Problems
    Noroozi, Vahid
    Bahaadini, Sara
    Zheng, Lei
    Xie, Sihong
    Shao, Weixiang
    Yu, Philip S.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 56 - 64
  • [22] Unified Representation Learning for Multi-View Clustering by Between/Within View Deep Majorization
    Zhang, Yue
    Yang, Sirui
    Huang, Weitian
    Wang, Chang-Dong
    Cai, Hongmin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 615 - 626
  • [23] Unbalanced Multi-view Deep Learning
    Xu, Cai
    Li, Zehui
    Guan, Ziyu
    Zhao, Wei
    Song, Xiangyu
    Wu, Yue
    Li, Jianxin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3051 - 3059
  • [24] Deep Multi-View Concept Learning
    Xu, Cai
    Guan, Ziyu
    Zhao, Wei
    Niu, Yunfei
    Wang, Quan
    Wang, Zhiheng
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2898 - 2904
  • [25] Conversion of two dimensional images into multi-view images of bone using deep learning
    Pradhan, Nitesh
    Singh, Vaibhav
    Kumar, Virat
    Goel, Parth
    Dhaka, Vijaypal Singh
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2021, 9 (01): : 106 - 113
  • [26] Deep Multi-View Learning to Rank
    Cao, Guanqun
    Iosifidis, Alexandros
    Gabbouj, Moncef
    Raghavan, Vijay
    Gottumukkala, Raju
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1426 - 1438
  • [27] Deep Partial Multi-View Learning
    Zhang, Changqing
    Cui, Yajie
    Han, Zongbo
    Zhou, Joey Tianyi
    Fu, Huazhu
    Hu, Qinghua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2402 - 2415
  • [28] Deep Generative Multi-view Learning
    Karami, Mahdi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 1167 : 465 - 477
  • [29] Multi-View Concept Learning for Data Representation
    Guan, Ziyu
    Zhang, Lijun
    Peng, Jinye
    Fan, Jianping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (11) : 3016 - 3028
  • [30] A survey on representation learning for multi-view data
    Qin, Yalan
    Zhang, Xinpeng
    Yu, Shui
    Feng, Guorui
    NEURAL NETWORKS, 2025, 181