MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video

被引:344
|
作者
Wei, Yinwei [1 ]
Wang, Xiang [2 ]
Nie, Liqiang [1 ]
He, Xiangnan [3 ]
Hong, Richang [4 ]
Chua, Tat-Seng [2 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Peoples R China
[4] Hefei Univ Technol, Hefei, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Graph Convolution Network; Multi-modal Recommendation; Micro-video Understanding;
D O I
10.1145/3343031.3351034
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Personalized recommendation plays a central role in many online content sharing platforms. To provide quality micro-video recommendation service, it is of crucial importance to consider the interactions between users and items (i.e., micro-videos) as well as the item contents from various modalities (e.g., visual, acoustic, and textual). Existing works on multimedia recommendation largely exploit multi-modal contents to enrich item representations, while less effort is made to leverage information interchange between users and items to enhance user representations and further capture user's fine-grained preferences on different modalities. In this paper, we propose to exploit user-item interactions to guide the representation learning in each modality, and further personalized micro-video recommendation. We design a Multimodal Graph Convolution Network (MMGCN) framework built upon the message-passing idea of graph neural networks, which can yield modal-specific representations of users and micro-videos to better capture user preferences. Specifically, we construct a user-item bipartite graph in each modality, and enrich the representation of each node with the topological structure and features of its neighbors. Through extensive experiments on three publicly available datasets, Tiktok, Kwai, and MovieLens, we demonstrate that our proposed model is able to significantly outperform state-of-the-art multi-modal recommendation methods.
引用
收藏
页码:1437 / 1445
页数:9
相关论文
共 50 条
  • [31] A hybrid filtering for micro-video hashtag recommendation using graph-based deep neural network
    Bansal, Shubhi
    Gowda, Kushaan
    Rehman, Mohammad Zia Ur
    Raghaw, Chandravardhan Singh
    Kumar, Nagendra
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [32] Multi-modal Graph and Sequence Fusion Learning for Recommendation
    Wang, Zejun
    Wu, Xinglong
    Yang, Hongwei
    He, Hui
    Tai, Yu
    Zhang, Weizhe
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 357 - 369
  • [33] Towards Developing a Multi-Modal Video Recommendation System
    Pingali, Sriram
    Mondal, Prabir
    Chakder, Daipayan
    Saha, Sriparna
    Ghosh, Angshuman
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [34] Personalized Multi-modal Video Retrieval on Mobile Devices
    Zhang, Haotian
    Jepson, Allan D.
    Mohomed, Iqbal
    Derpanis, Konstantinos G.
    Zhang, Ran
    Fazly, Afsaneh
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1185 - 1191
  • [35] Meta-path based graph contrastive learning for micro-video recommendation
    He, Ying
    Wu, Gongqing
    Cai, Desheng
    Hu, Xuegang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 222
  • [36] Modeling multi-behavior sequence via HyperGRU contrastive network for micro-video recommendation
    Gu, Pan
    Hu, Haiyang
    Xu, Guandong
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [37] Personalized Context-Aware Multi-Modal Transportation Recommendation
    Chen, Xianda
    Zhu, Meixin
    Tiu, PakHin
    Wang, Yinhai
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 3276 - 3281
  • [38] Personalized clothing matching recommendation based on multi-modal fusion
    Liu J.
    Zhang F.
    Hu X.
    Peng T.
    Li L.
    Zhu Q.
    Zhang J.
    Fangzhi Xuebao/Journal of Textile Research, 2023, 44 (03): : 176 - 186
  • [39] Hierarchical Multi-Modal Attention Network for Time-Sync Comment Video Recommendation
    Zhao, Weihao
    Wu, Han
    He, Weidong
    Bi, Haoyang
    Wang, Hao
    Zhu, Chen
    Xu, Tong
    Chen, Enhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2694 - 2705
  • [40] Multi-Modal Transportation Recommendation Based on Graph Embedding and CaGBDT
    Sun Q.-M.
    Chang L.
    Ma C.
    Qu Z.-J.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (05): : 81 - 87and106