Music style classification by jointly using CNN and Transformer

被引:0
|
作者
Tang, Rui [1 ]
Qi, Miao [1 ]
Wang, Qingnan [1 ]
机构
[1] Northeast Normal Univ, Coll Informat Sci & Technol, Changchun 130117, Peoples R China
关键词
Music style; Audio classification; CNN; Transformer;
D O I
10.1145/3651671.3651696
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Music influences people in many ways and plays an important role in human life from emotional expression to social interaction to cognitive development. However, the variety of musical styles is often difficult to distinguish. In this paper, different from existing methods that music presented in the form of audio information can be classified as a sequence of features divided by time through RNN or LSTM, a novel music style classification method is proposed by transforming music audio into audio image. Moreover, Convolutional Neural Network (CNN) and Transformer are combined to jointly extract rich audio image features for music style classification. The effectiveness of the proposed method is evaluated by a large number of ablation and comparative experiments. The experimental results demonstrate that the classification accuracy of our proposed method can achieve satisfactory classification accuracy and is better than some existing classification methods on GTZAN dataset.
引用
收藏
页码:707 / 712
页数:6
相关论文
共 50 条
  • [41] CULTURAL STYLE BASED MUSIC CLASSIFICATION OF AUDIO SIGNALS
    Liu, Yuxiang
    Xiang, Qiaoliang
    Wang, Ye
    Cai, Lianhong
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 57 - +
  • [42] Artificial Neural Network for Folk Music Style Classification
    Ning, Qinliang
    Shi, Junyan
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [43] A Deep CNN Transformer Hybrid Model for Skin Lesion Classification of Dermoscopic Images Using Focal Loss
    Nie, Yali
    Sommella, Paolo
    Carratu, Marco
    O'Nils, Mattias
    Lundgren, Jan
    DIAGNOSTICS, 2023, 13 (01)
  • [44] Stylized Image Generation based on Music-image Synesthesia Emotional Style Transfer using CNN Network
    Xing, Baixi
    Dou, Jian
    Huang, Qing
    Si, Huahao
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (04): : 1464 - 1485
  • [45] 1D CNN Architectures for Music Genre Classification
    Allamy, Safaa
    Koerich, Alessandro Lameiras
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [46] A cable insulation defect classification method based on CNN-transformer
    Zhao, Ning
    Duan, Zhiguo
    Li, Qian
    Guo, Kang
    Zhang, Ziguang
    Liu, Baoan
    FRONTIERS IN PHYSICS, 2024, 12
  • [47] Interactive transformer and CNN network for fusion classification of hyperspectral and LiDAR data
    Wang, Leiquan
    Liu, Wenwen
    Lyu, Dong
    Zhang, Peiying
    Guo, Fangming
    Hu, Yabin
    Xu, Mingming
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024,
  • [48] Fine-Grained Ship Classification by Combining CNN and Swin Transformer
    Huang, Liang
    Wang, Fengxiang
    Zhang, Yalun
    Xu, Qingxia
    REMOTE SENSING, 2022, 14 (13)
  • [49] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [50] Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer
    Wu H.
    Dai S.
    Wang A.
    Yuji I.
    Yu X.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (07): : 1087 - 1100