Music genre classification based on fusing audio and lyric information

被引:0
|
作者
You Li
Zhihai Zhang
Han Ding
Liang Chang
机构
[1] Guilin University of Electronic Technology,Guangxi Key Laboratory of Trusted Software
[2] Guilin University of Electronic Technology,School of Electronic Engineering and Automation
来源
关键词
Music genre classification; Audio information; Lyric information; Information fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Music genre classification (MGC) has a wide range of application scenarios. Traditional MGC methods only consider either audio information or lyric information, resulting in an unsatisfactory recognition effect. In this paper, we propose a multimodal music genre classification framework that integrates both audio information and lyric information. By using the complementarity of multimodal information, music genres can be represented more comprehensively. First, the framework extracts the mel-spectrogram of audio, and a convolutional neural network is used to extract audio features. Simultaneously, BERT is used to obtain the distributed representation of the lyrics. Then, the two modal pieces of information are fused through different strategies, such as at the feature level and decision level. To solve the serious inconsistency between the convergence speed of the audio channel and the lyric channel, we adopt the strategy of asynchronous start training of two channels and different learning rates. A series of experiments are carried out to verify the effectiveness of the proposed model. The F1 score of the proposed model is 0.87 for music genre classification, which is approximately 4% higher than that of the best baseline in the experiment.
引用
收藏
页码:20157 / 20176
页数:19
相关论文
共 50 条
  • [41] Improving Automatic Music Genre Classification Systems by Using Descriptive Statistical Features of Audio Signals
    Perera, Ravindu
    Wickramasinghe, Manjusri
    Jayaratne, Lakshman
    ARTIFICIAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN, EVOMUSART 2023, 2023, 13988 : 399 - 412
  • [42] CULTURAL STYLE BASED MUSIC CLASSIFICATION OF AUDIO SIGNALS
    Liu, Yuxiang
    Xiang, Qiaoliang
    Wang, Ye
    Cai, Lianhong
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 57 - +
  • [43] Music Genre Classification via Joint Sparse Low-Rank Representation of Audio Features
    Panagakis, Yannis
    Kotropoulos, Constantine L.
    Arce, Gonzalo R.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1905 - 1917
  • [44] A Survey of Audio-Based Music Classification and Annotation
    Fu, Zhouyu
    Lu, Guojun
    Ting, Kai Ming
    Zhang, Dengsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 303 - 319
  • [45] Research on Music Emotion Classification Based on Lyrics and Audio
    Shi, Wanglei
    Feng, Shuang
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1154 - 1159
  • [46] A sequential naive Bayes method for music genre classification based on transitional information from pitch and beat
    Ren, Tunan
    Wang, Feifei
    Wang, Hansheng
    STATISTICS AND ITS INTERFACE, 2020, 13 (03) : 361 - 371
  • [47] Web Application for Machine Learning based Music Genre Classification
    Chauhan, Jugal
    Shah, Jash
    Mundhe, Eeshan
    Jain, Ishan
    2021 7th IEEE International Conference on Advances in Computing, Communication and Control, ICAC3 2021, 2021,
  • [48] Music Genre Classification Algorithm Based on Multihead Attention Mechanism
    Cheng, Peng
    ADVANCES IN MULTIMEDIA, 2022, 2022
  • [49] Music Genre Classification using EMD and Pitch Based Feature
    Sarkar, Rajib
    Saha, Sanjoy Kumar
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 257 - +
  • [50] A survey on symbolic data-based music genre classification
    Correa, Debora C.
    Ap Rodrigues, Francisco
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 60 : 190 - 210