MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video

被引:344
|
作者
Wei, Yinwei [1 ]
Wang, Xiang [2 ]
Nie, Liqiang [1 ]
He, Xiangnan [3 ]
Hong, Richang [4 ]
Chua, Tat-Seng [2 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Peoples R China
[4] Hefei Univ Technol, Hefei, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Graph Convolution Network; Multi-modal Recommendation; Micro-video Understanding;
D O I
10.1145/3343031.3351034
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Personalized recommendation plays a central role in many online content sharing platforms. To provide quality micro-video recommendation service, it is of crucial importance to consider the interactions between users and items (i.e., micro-videos) as well as the item contents from various modalities (e.g., visual, acoustic, and textual). Existing works on multimedia recommendation largely exploit multi-modal contents to enrich item representations, while less effort is made to leverage information interchange between users and items to enhance user representations and further capture user's fine-grained preferences on different modalities. In this paper, we propose to exploit user-item interactions to guide the representation learning in each modality, and further personalized micro-video recommendation. We design a Multimodal Graph Convolution Network (MMGCN) framework built upon the message-passing idea of graph neural networks, which can yield modal-specific representations of users and micro-videos to better capture user preferences. Specifically, we construct a user-item bipartite graph in each modality, and enrich the representation of each node with the topological structure and features of its neighbors. Through extensive experiments on three publicly available datasets, Tiktok, Kwai, and MovieLens, we demonstrate that our proposed model is able to significantly outperform state-of-the-art multi-modal recommendation methods.
引用
收藏
页码:1437 / 1445
页数:9
相关论文
共 50 条
  • [41] Collaborative denoised graph contrastive learning for multi-modal recommendation
    Xu, Fuyong
    Zhu, Zhenfang
    Fu, Yixin
    Wang, Ru
    Liu, Peiyu
    INFORMATION SCIENCES, 2024, 679
  • [42] A Unified Graph Transformer for Overcoming Isolations in Multi-modal Recommendation
    Yi, Zixuan
    Ounis, Iadh
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 518 - 527
  • [43] Guiding Graph Learning with Denoised Modality for Multi-modal Recommendation
    Wang, Yuexian
    Ma, Wenze
    Zhu, Yanmin
    Wang, Chunyang
    Wang, Zhaobo
    Tang, Feilong
    Yu, Jiadi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VI, DASFAA 2024, 2024, 14855 : 220 - 235
  • [44] Video recommendation based on multi-modal information and multiple kernel
    Zhan Li
    Jin-Ye Peng
    Guo-Hua Geng
    Xiao-Jiang Chen
    Pan-Pan Zheng
    Multimedia Tools and Applications, 2015, 74 : 4599 - 4616
  • [45] MULTI-MODAL REPRESENTATION LEARNING FOR SHORT VIDEO UNDERSTANDING AND RECOMMENDATION
    Guo, Daya
    Hong, Jiangshui
    Luo, Binli
    Yan, Qirui
    Niu, Zhangming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 687 - 690
  • [46] Video recommendation based on multi-modal information and multiple kernel
    Li, Zhan
    Peng, Jin-Ye
    Geng, Guo-Hua
    Chen, Xiao-Jiang
    Zheng, Pan-Pan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (13) : 4599 - 4616
  • [47] Multi-modal visual adversarial Bayesian personalized ranking model for recommendation
    Li, Guangli
    Zhuo, Jianwu
    Li, Chuanxiu
    Hua, Jin
    Yuan, Tian
    Niu, Zhengyu
    Ji, Donghong
    Wu, Renzhong
    Zhang, Hongbin
    INFORMATION SCIENCES, 2021, 572 : 378 - 403
  • [48] TriGCN: Graph Convolution Network Based on Tripartite Graph for Personalized Medicine Recommendation System
    Zhou, Huan
    Liao, Sisi
    Guo, Fanying
    SYSTEMS, 2024, 12 (10):
  • [49] Object Interaction Recommendation with Multi-Modal Attention-based Hierarchical Graph Neural Network
    Zhang, Huijuan
    Liang, Lipeng
    Wang, Dongqing
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 295 - 305
  • [50] Alleviating Video-length Effect for Micro-video Recommendation
    Quan, Yuhan
    Ding, Jingtao
    Gao, Chen
    Li, Nian
    Yi, Lingling
    Jin, Depeng
    Li, Yong
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)