Deep attention based music genre classification

被引:46
|
作者
Yu, Yang [1 ]
Luo, Sen [2 ]
Liu, Shenglan [2 ]
Qiao, Hong [3 ]
Liu, Yang [2 ]
Feng, Lin [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Sch Innovat & Enterpreneurship, Dalian 116024, Peoples R China
[3] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Inst Automat, Beijing 100190, Peoples R China
关键词
Music genre classification; Deep neural networks; Serial attention; Parallelized attention; FEATURES; NETWORKS;
D O I
10.1016/j.neucom.2019.09.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important component of music information retrieval, music genre classification attracts great attentions these years. Benefitting from the outstanding performance of deep neural networks in computer vision, some researchers apply CNN on music genre classification tasks with audio spectrograms as input instead, which has similarities with RGB images. These methods are based on a latent assumption that spectrums with different temporal steps have equal importance. However, it goes against the theory of processing bottleneck in psychology as well as our observation from audio spectrograms. By considering the differences of spectrums, we propose a new model incorporating with attention mechanism based on Bidirectional Recurrent Neural Network. Furthermore, two attention-based models (serial attention and parallelized attention) are implemented in this paper. Comparing with serial attention, parallelized attention is more flexible and gets better results in our experiments. Especially, the CNN-based parallelized attention models with taking STFT spectrograms as input outperform the previous work. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [41] Music genre classification using deep learning: a comparative analysis of CNNs and RNNs
    Xu, Wenyi
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [42] Automated Music Genre Classification using Modified MobileNet Deep Learning Model
    Bohra, Manvi
    Kumar, Indrajeet
    Shivam
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 767 - 772
  • [43] Evaluation of Music Features for PUK Kernel based Genre Classification
    Chapaneri, Santhosh
    Lopes, Renia
    Jayaswal, Deepak
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING TECHNOLOGIES AND APPLICATIONS (ICACTA), 2015, 45 : 186 - 196
  • [44] Generative Adversarial Networks Based Framework for Music Genre Classification
    Pulkit Dwivedi
    Benazir Islam
    SN Computer Science, 5 (8)
  • [45] Artificial Immune System-Based Music Genre Classification
    Sotiropoulos, D. N.
    Lampropoulos, A. S.
    Tsihrintzis, G. A.
    NEW DIRECTIONS IN INTELLIGENT INTERACTIVE MULTIMEDIA, 2008, 142 : 191 - 200
  • [46] Music genre classification based on fusing audio and lyric information
    Li, You
    Zhang, Zhihai
    Ding, Han
    Chang, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20157 - 20176
  • [47] Unsupervised Music Genre Classification with a Model-Based Approach
    Barreira, Luis
    Cavaco, Sofia
    da Silva, Joaquim Ferreira
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BOOK, 2011, 7026 : 268 - 281
  • [48] Symbolic music genre classification based on note pitch and duration
    Karydis, Ioannis
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4152 : 329 - 338
  • [49] Music Genre Classification Based on VMD-IWOA-XGBOOST
    Gan, Rumeijiang
    Huang, Tichen
    Shao, Jin
    Wang, Fuyu
    MATHEMATICS, 2024, 12 (10)
  • [50] Research on music genre recognition method based on deep learning
    Guo, Yuchen
    MCB Molecular and Cellular Biomechanics, 2024, 21 (01):