MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video

被引:344
|
作者
Wei, Yinwei [1 ]
Wang, Xiang [2 ]
Nie, Liqiang [1 ]
He, Xiangnan [3 ]
Hong, Richang [4 ]
Chua, Tat-Seng [2 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Peoples R China
[4] Hefei Univ Technol, Hefei, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Graph Convolution Network; Multi-modal Recommendation; Micro-video Understanding;
D O I
10.1145/3343031.3351034
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Personalized recommendation plays a central role in many online content sharing platforms. To provide quality micro-video recommendation service, it is of crucial importance to consider the interactions between users and items (i.e., micro-videos) as well as the item contents from various modalities (e.g., visual, acoustic, and textual). Existing works on multimedia recommendation largely exploit multi-modal contents to enrich item representations, while less effort is made to leverage information interchange between users and items to enhance user representations and further capture user's fine-grained preferences on different modalities. In this paper, we propose to exploit user-item interactions to guide the representation learning in each modality, and further personalized micro-video recommendation. We design a Multimodal Graph Convolution Network (MMGCN) framework built upon the message-passing idea of graph neural networks, which can yield modal-specific representations of users and micro-videos to better capture user preferences. Specifically, we construct a user-item bipartite graph in each modality, and enrich the representation of each node with the topological structure and features of its neighbors. Through extensive experiments on three publicly available datasets, Tiktok, Kwai, and MovieLens, we demonstrate that our proposed model is able to significantly outperform state-of-the-art multi-modal recommendation methods.
引用
收藏
页码:1437 / 1445
页数:9
相关论文
共 50 条
  • [21] Personalized Micro-Video Recommendation via Hierarchical User Interest Modeling
    Huang, Lei
    Luo, Bin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 564 - 574
  • [22] Heterogeneous-Grained Multi-Modal Graph Network for Outfit Recommendation
    Xu, Rucong
    Wang, Jianfeng
    Li, Yun
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1788 - 1799
  • [23] Enhancing Micro-Video Venue Recognition via Multi-Modal and Multi-Granularity Object Relations
    Liu, Weijia
    Cao, Jiuxin
    Wei, Ran
    Zhu, Xuelin
    Liu, Bo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5440 - 5451
  • [24] Multi-Modal Correction Network for Recommendation
    Wang, Zengmao
    Feng, Yunzhen
    Zhang, Xin
    Yang, Renjie
    Du, Bo
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (02) : 810 - 822
  • [25] MMGCN: Multi-modal multi-view graph convolutional networks for cancer prognosis prediction
    Yang, Ping
    Chen, Wengxiang
    Qiu, Hang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 257
  • [26] Multi-modal sequence model with gated fully convolutional blocks for micro-video venue classification
    Wei Liu
    Xianglin Huang
    Gang Cao
    Jianglong Zhang
    Gege Song
    Lifang Yang
    Multimedia Tools and Applications, 2020, 79 : 6709 - 6726
  • [27] Multi-modal sequence model with gated fully convolutional blocks for micro-video venue classification
    Liu, Wei
    Huang, Xianglin
    Cao, Gang
    Zhang, Jianglong
    Song, Gege
    Yang, Lifang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 6709 - 6726
  • [28] Optimizing Personalized E-Commerce Micro-Video Recommendation with Self-Adaption Generative Gating Graph
    Chen, Peng
    Tan, Yingshui
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1095 - 1106
  • [29] Multi-trends Enhanced Dynamic Micro-video Recommendation
    Lu, Yujie
    Huang, Yingxuan
    Zhang, Shengyu
    Han, Wei
    Chen, Hui
    Fan, Wenyan
    Lai, Jiangliang
    Zhao, Zhou
    Wu, Fei
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 430 - 441
  • [30] MMM-GCN: Multi-Level Multi-Modal Graph Convolution Network for Video-Based Person Identification
    Liao, Ziyan
    Di, Dening
    Hao, Jingsong
    Zhang, Jiang
    Zhu, Shulei
    Yin, Jun
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 3 - 15