Deconfounded Cross-modal Matching for Content-based Micro-video Background Music Recommendation

被引:0
|
作者
Yi, Jing [1 ]
Chen, Zhenzhong [1 ,2 ,3 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Luoyu Rd 129, Wuhan 430079, Hubei, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Luoyu Rd 129, Wuhan 430079, Hubei, Peoples R China
[3] Hubei Luojia Lab, Luoyu Rd 129, Wuhan 430079, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal matching; debiased recommender systems; knowledge distillation; variational auto-encoder;
D O I
10.1145/3650042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object-oriented micro-video background music recommendation is a complicated task where the matching degree between videos and background music is a major issue. However, music selections in user-generated content ( UGC) are prone to selection bias caused by historical preferences of uploaders. Since historical preferences are not fully reliable and may reflect obsolete behaviors, over-reliance on them should be avoided as knowledge and interests dynamically evolve. In this article, we propose a Deconfounded Cross-Modal matching model to mitigate such bias. Specifically, uploaders' personal preferences of music genres are identified as confounders that spuriously correlate music embeddings and background music selections, causing the learned system to over-recommend music from majority groups. To resolve such confounders, backdoor adjustment is utilized to deconfound the spurious correlation between music embeddings and prediction scores. We further utilize Monte Carlo estimator with batch-level average as the approximations to avoid integrating the entire confounder space calculated by the adjustment. Furthermore, we design a teacher-student network to utilize the matching of music videos, which is professionally generated content (PGC) with specialized matching, to better recommend content-matching background music. The PGC data are modeled by a teacher network to guide the matching of uploader-selected UGC data of student network by KullbackLeibler-based knowledge transfer. Extensive experiments on the TT-150k-genre dataset demonstrate the effectiveness of the proposed method. The code is publicly available on https://github.com/jing- 1/DecCM
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Implicit Rating Methods Based on Interest Preferences of Categories for Micro-Video Recommendation
    Chen, Jie
    Peng, Junjie
    Qi, Lizhe
    Chen, Gan
    Zhang, Wenqiang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 371 - 381
  • [22] Cross-modal contrastive learning for aspect-based recommendation
    Won, Heesoo
    Oh, Byungkook
    Yang, Hyeongjun
    Lee, Kyong-Ho
    INFORMATION FUSION, 2023, 99
  • [23] Video-Based Cross-Modal Recipe Retrieval
    Cao, Da
    Yu, Zhiwang
    Zhang, Hanling
    Fang, Jiansheng
    Nie, Liqiang
    Tian, Qi
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1685 - 1693
  • [24] Cross-Modal Content Inference and Feature Enrichment for Cold-Start Recommendation
    Ma, Haokai
    Qi, Zhuang
    Dong, Xinxin
    Li, Xiangxian
    Zheng, Yuze
    Meng, Xiangxu
    Meng, Lei
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [25] Music recommendation using dynamic feedback and content-based filtering
    Magadum, Hrishikesh
    Azad, Hiteshwar Kumar
    Patel, Harpal
    Rohan, H. R.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (32) : 77469 - 77488
  • [26] Effective social content-based collaborative filtering for music recommendation
    Su, Ja-Hwung
    Chang, Wei-Yi
    Tseng, Vincent S.
    INTELLIGENT DATA ANALYSIS, 2017, 21 : S195 - S216
  • [27] Towards Developing a Content-Based Recommendation System for Classical Music
    Cruz, Ana Felicia T.
    Coronel, Andrei D.
    INFORMATION SCIENCE AND APPLICATIONS, 2020, 621 : 451 - 462
  • [28] A Kernel Framework for Content-Based Artist Recommendation System in Music
    Chen, Zhi-Sheng
    Jang, Jyh-Shing Roger
    Lee, Chin-Hui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (06) : 1371 - 1380
  • [29] Music-CRN: an Efficient Content-Based Music Classification and Recommendation Network
    Yuxu Mao
    Guoqiang Zhong
    Haizhen Wang
    Kaizhu Huang
    Cognitive Computation, 2022, 14 : 2306 - 2316
  • [30] Music-CRN: an Efficient Content-Based Music Classification and Recommendation Network
    Mao, Yuxu
    Zhong, Guoqiang
    Wang, Haizhen
    Huang, Kaizhu
    COGNITIVE COMPUTATION, 2022, 14 (06) : 2306 - 2316