Learning Multimodal Neural Network with Ranking Examples

被引:7
|
作者
Lu, Xinyan [1 ]
Wu, Fei [1 ]
Li, Xi [1 ]
Zhang, Yin [1 ]
Lu, Weiming [1 ]
Wang, Donghui [1 ]
Zhuang, Yueting [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou, Zhejiang, Peoples R China
来源
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14) | 2014年
关键词
Cross-modal Ranking; Learning to rank; Representation Learning;
D O I
10.1145/2647868.2655001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To support cross-modal information retrieval, cross-modal learning to rank approaches utilize ranking examples (e.g., an example may be a text query and its corresponding ranked images) to learn appropriate ranking (similarity) function. However, the fact that each modality is represented with intrinsically different low-level features hinders these approaches from better reducing the heterogeneity-gap between the modalities and thus giving satisfactory retrieval results. In this paper, we consider learning with neural networks, from the perspective of optimizing the listwise ranking loss of the cross-modal ranking examples. The proposed model, named Cross-Modal Ranking Neural Network (CMRNN), benefits from the advance of both neural networks on learning high-level semantics and learning to rank techniques on learning ranking function, such that the learned cross-modal ranking function is implicitly embedded in the learned high-level representation for data objects with different modalities (e.g., text and imagery) to perform cross-modal retrieval directly. We compare CMRNN to existing state-of-the-art cross-modal ranking methods on two datasets and show that it achieves a better performance.
引用
收藏
页码:985 / 988
页数:4
相关论文
共 50 条
  • [1] A neural network approach for learning object ranking
    Rigutini, Leonardo
    Papini, Tiziano
    Maggini, Marco
    Bianchini, Monica
    ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT II, 2008, 5164 : 899 - 908
  • [2] LEARNING FROM EXAMPLES IN A SINGLE-LAYER NEURAL NETWORK
    HANSEL, D
    SOMPOLINSKY, H
    EUROPHYSICS LETTERS, 1990, 11 (07): : 687 - 692
  • [3] Multimodal learning using Convolution Neural Network and Sparse Autoencoder
    Vu, Tien Duong
    Yang, Hyung-Jeong
    Nguyen, Van Quan
    Oh, A-Ran
    Kim, Mi-Sun
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 309 - 312
  • [4] Multimodal Fast-Slow Neural Network for learning engagement evaluation
    Zhang, Lizhao
    Hung, Jui-Long
    Du, Xu
    Li, Hao
    Hu, Zhuang
    DATA TECHNOLOGIES AND APPLICATIONS, 2023, 57 (03) : 418 - 435
  • [5] Multimodal emotion recognition based on manifold learning and convolution neural network
    Zhang, Yong
    Cheng, Cheng
    Zhang, YiDie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33253 - 33268
  • [6] Multimodal emotion recognition based on manifold learning and convolution neural network
    Yong Zhang
    Cheng Cheng
    YiDie Zhang
    Multimedia Tools and Applications, 2022, 81 : 33253 - 33268
  • [7] A Boolean neural network as a rule based system learning new rules by examples
    Lauria, FE
    Prevete, R
    Milo, M
    Visco, S
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 1637 - 1642
  • [8] Multimodal Label Relevance Ranking via Reinforcement Learning
    Guo, Taian
    Zhang, Taolin
    Wu, Haoqian
    Li, Hanjun
    Qiao, Ruizhi
    Sun, Xing
    COMPUTER VISION - ECCV 2024, PT LXVI, 2025, 15124 : 391 - 408
  • [9] Multimodal Learning with Triplet Ranking Loss for Visual Semantic Embedding Learning
    Yang, Zhanbo
    Li, Li
    He, Jun
    Wei, Zixi
    Liu, Li
    Liao, Jun
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 763 - 773
  • [10] Examples of realistic neural network simulations
    Lansner, A.
    Ekeberg, Oe.
    Wadden, T.
    Traven, H.
    Fransen, E.
    Grillner, S.
    Wallen, P.
    Brodin, L.
    Proceedings of the International Conference on Artificial Neural Networks, 1991,