An Audiovisual Correlation Matching Method Based on Fine-Grained Emotion and Feature Fusion

被引:1
|
作者
Su, Zhibin [1 ,2 ,3 ]
Feng, Yiming [2 ,3 ]
Liu, Jinyu [2 ,3 ]
Peng, Jing [3 ]
Jiang, Wei [1 ,2 ,3 ]
Liu, Jingyu [1 ,2 ,3 ]
机构
[1] State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
[2] Minist Culture & Tourism, Key Lab Acoust Visual Technol & Intelligent Contro, Beijing 100024, Peoples R China
[3] Commun Univ China, Sch Informat & Commun Engn, Beijing 100024, Peoples R China
基金
中国国家自然科学基金;
关键词
fine-grained affects; music-video matching; audiovisual association; CCA feature fusion; factor analysis; hybrid matching model; affective similarity;
D O I
10.3390/s24175681
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Most existing intelligent editing tools for music and video rely on the cross-modal matching technology of the affective consistency or the similarity of feature representations. However, these methods are not fully applicable to complex audiovisual matching scenarios, resulting in low matching accuracy and suboptimal audience perceptual effects due to ambiguous matching rules and associated factors. To address these limitations, this paper focuses on both the similarity and integration of affective distribution for the artistic audiovisual works of movie and television video and music. Based on the rich emotional perception elements, we propose a hybrid matching model based on feature canonical correlation analysis (CCA) and fine-grained affective similarity. The model refines KCCA fusion features by analyzing both matched and unmatched music-video pairs. Subsequently, the model employs XGBoost to predict relevance and to compute similarity by considering fine-grained affective semantic distance as well as affective factor distance. Ultimately, the matching prediction values are obtained through weight allocation. Experimental results on a self-built dataset demonstrate that the proposed affective matching model balances feature parameters and affective semantic cognitions, yielding relatively high prediction accuracy and better subjective experience of audiovisual association. This paper is crucial for exploring the affective association mechanisms of audiovisual objects from a sensory perspective and improving related intelligent tools, thereby offering a novel technical approach to retrieval and matching in music-video editing.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Multi-Scale Feature Transformer Based Fine-Grained Image Classification Method
    Zhang T.
    Cai C.
    Luo X.
    Zhu Y.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (04): : 70 - 75
  • [42] Fine-grained image recognition method for digital media based on feature enhancement strategy
    Tieyu Zhou
    Linyi Gao
    Ranjun Hua
    Junhong Zhou
    Jinao Li
    Yawen Guo
    Yan Zhang
    Neural Computing and Applications, 2024, 36 : 2323 - 2335
  • [43] Fine-grained pornographic image recognition with multiple feature fusion transfer learning
    Xinnan Lin
    Feiwei Qin
    Yong Peng
    Yanli Shao
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 73 - 86
  • [44] Fine-grained pornographic image recognition with multiple feature fusion transfer learning
    Lin, Xinnan
    Qin, Feiwei
    Peng, Yong
    Shao, Yanli
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (01) : 73 - 86
  • [45] CANCEREMO : A Dataset for Fine-Grained Emotion Detection
    Sosea, Tiberiu
    Caragea, Cornelia
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8892 - 8904
  • [46] Complemental Attention Multi-Feature Fusion Network for Fine-Grained Classification
    Miao, Zhuang
    Zhao, Xun
    Wang, Jiabao
    Li, Yang
    Li, Hang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1983 - 1987
  • [47] Multilayer feature fusion with parallel convolutional block for fine-grained image classification
    Wang, Lei
    He, Kai
    Feng, Xu
    Ma, Xitao
    APPLIED INTELLIGENCE, 2022, 52 (03) : 2872 - 2883
  • [48] Fine-Grained Visual Categorization: A Spatial-Frequency Feature Fusion Perspective
    Wang, Min
    Zhao, Peng
    Lu, Xin
    Min, Fan
    Wang, Xizhao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2798 - 2812
  • [49] Multilayer feature fusion with parallel convolutional block for fine-grained image classification
    Lei Wang
    Kai He
    Xu Feng
    Xitao Ma
    Applied Intelligence, 2022, 52 : 2872 - 2883
  • [50] Multilayer feature descriptors fusion CNN models for fine-grained visual recognition
    Hou, Yong
    Luo, Hangzai
    Zhao, Wanqing
    Zhang, Xiang
    Wang, Jun
    Peng, Jinye
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2019, 30 (3-4)