An Audiovisual Correlation Matching Method Based on Fine-Grained Emotion and Feature Fusion

被引:1
|
作者
Su, Zhibin [1 ,2 ,3 ]
Feng, Yiming [2 ,3 ]
Liu, Jinyu [2 ,3 ]
Peng, Jing [3 ]
Jiang, Wei [1 ,2 ,3 ]
Liu, Jingyu [1 ,2 ,3 ]
机构
[1] State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
[2] Minist Culture & Tourism, Key Lab Acoust Visual Technol & Intelligent Contro, Beijing 100024, Peoples R China
[3] Commun Univ China, Sch Informat & Commun Engn, Beijing 100024, Peoples R China
基金
中国国家自然科学基金;
关键词
fine-grained affects; music-video matching; audiovisual association; CCA feature fusion; factor analysis; hybrid matching model; affective similarity;
D O I
10.3390/s24175681
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Most existing intelligent editing tools for music and video rely on the cross-modal matching technology of the affective consistency or the similarity of feature representations. However, these methods are not fully applicable to complex audiovisual matching scenarios, resulting in low matching accuracy and suboptimal audience perceptual effects due to ambiguous matching rules and associated factors. To address these limitations, this paper focuses on both the similarity and integration of affective distribution for the artistic audiovisual works of movie and television video and music. Based on the rich emotional perception elements, we propose a hybrid matching model based on feature canonical correlation analysis (CCA) and fine-grained affective similarity. The model refines KCCA fusion features by analyzing both matched and unmatched music-video pairs. Subsequently, the model employs XGBoost to predict relevance and to compute similarity by considering fine-grained affective semantic distance as well as affective factor distance. Ultimately, the matching prediction values are obtained through weight allocation. Experimental results on a self-built dataset demonstrate that the proposed affective matching model balances feature parameters and affective semantic cognitions, yielding relatively high prediction accuracy and better subjective experience of audiovisual association. This paper is crucial for exploring the affective association mechanisms of audiovisual objects from a sensory perspective and improving related intelligent tools, thereby offering a novel technical approach to retrieval and matching in music-video editing.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] A fine-grained human facial key feature extraction and fusion method for emotion recognition
    Shiwei Li
    Jisen Wang
    Linbo Tian
    Jianqiang Wang
    Yan Huang
    Scientific Reports, 15 (1)
  • [2] Partial Multimodal Hashing Based on Fine-grained Feature Fusion
    Yin Z.-Z.
    Li B.-H.
    Wang M.
    Huang R.-L.
    Wu W.-L.
    Wang H.-F.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (03): : 1074 - 1089
  • [3] Detecting Android malware: A multimodal fusion method with fine-grained feature
    Li, Xun
    Liu, Lei
    Liu, Yuzhou
    Liu, Huaxiao
    Information Fusion, 2025, 114
  • [4] Fine-Grained Image Classification Based on Feature Fusion and Ensemble Learning
    Zhang, Wenli
    Wei, Song
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (22)
  • [5] Fine-Grained Image Classification Based on Target Acquisition and Feature Fusion
    Chu, Yan
    Wang, Zhengkui
    Wang, Lina
    Zhao, Qingchao
    Shan, Wen
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 209 - 221
  • [6] Target Detection Optimization Model Based On Fine-grained Feature Fusion
    Bao, Xianfu
    Qiang, Zanxia
    Bai, Guangyao
    Yang, Rui
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND INTELLIGENT CONTROL (IPIC 2021), 2021, 11928
  • [7] A Feature Fusion Method Based on Multi-Classification Losses for Fine-Grained Visual Categorization
    Zhu, Mengmeng
    Wan, Shouhong
    Jin, Peiquan
    Tian, Qijun
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 6072 - 6074
  • [8] A cross-granularity feature fusion method for fine-grained image recognition
    Wu, Shan
    Hu, Jun
    Sun, Chen
    Zhong, Fujin
    Zhang, Qinghua
    Wang, Guoyin
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [9] A NOVEL PART FEATURE INTEGRATION AND FUSION METHOD FOR FINE-GRAINED VEHICLE RECOGNITION
    Wang, Ping
    Cao, Yijie
    Lu, Lei
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1990 - 1994
  • [10] Dynamic Pavement Distress Image Stitching Based on Fine-Grained Feature Matching
    Du, Yuchuan
    Weng, Zihang
    Liu, Chenglong
    Wu, Difei
    JOURNAL OF ADVANCED TRANSPORTATION, 2020, 2020