Music style classification by jointly using CNN and Transformer

被引:0
|
作者
Tang, Rui [1 ]
Qi, Miao [1 ]
Wang, Qingnan [1 ]
机构
[1] Northeast Normal Univ, Coll Informat Sci & Technol, Changchun 130117, Peoples R China
关键词
Music style; Audio classification; CNN; Transformer;
D O I
10.1145/3651671.3651696
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Music influences people in many ways and plays an important role in human life from emotional expression to social interaction to cognitive development. However, the variety of musical styles is often difficult to distinguish. In this paper, different from existing methods that music presented in the form of audio information can be classified as a sequence of features divided by time through RNN or LSTM, a novel music style classification method is proposed by transforming music audio into audio image. Moreover, Convolutional Neural Network (CNN) and Transformer are combined to jointly extract rich audio image features for music style classification. The effectiveness of the proposed method is evaluated by a large number of ablation and comparative experiments. The experimental results demonstrate that the classification accuracy of our proposed method can achieve satisfactory classification accuracy and is better than some existing classification methods on GTZAN dataset.
引用
收藏
页码:707 / 712
页数:6
相关论文
共 50 条
  • [1] A Hybrid Parallel Computing Architecture Based on CNN and Transformer for Music Genre Classification
    Chen, Jiyang
    Ma, Xiaohong
    Li, Shikuan
    Ma, Sile
    Zhang, Zhizheng
    Ma, Xiaojing
    ELECTRONICS, 2024, 13 (16)
  • [2] A CNN-Based Approach for Classical Music Recognition and Style Emotion Classification
    Shi, Yawen
    IEEE ACCESS, 2025, 13 : 20647 - 20666
  • [3] Joint Classification of Hyperspectral and LiDAR Data Using a Hierarchical CNN and Transformer
    Zhao, Guangrui
    Ye, Qiaolin
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    Jeon, Byeungwoo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] Identification and classification of power quality disturbances using CNN-transformer
    Wang, Gaofeng
    Zhang, Hao
    Gao, Man
    Ding, Wuren
    Qian, Yun
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2025,
  • [5] CaVIT: An integrated method for image style transfer using parallel CNN and vision transformer
    Zhang, Zaifang
    Lu, Shunlu
    Guo, Qing
    Gao, Nan
    Yang, Yuxiao
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [6] CNN and transformer framework for insect pest classification
    Peng, Yingshu
    Wang, Yi
    ECOLOGICAL INFORMATICS, 2022, 72
  • [7] FCT: fusing CNN and transformer for scene classification
    Xie, Yuxiang
    Yan, Jie
    Kang, Lai
    Guo, Yanming
    Zhang, Jiahui
    Luan, Xidao
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (04) : 611 - 618
  • [8] CNN-Transformer for Microseismic Signal Classification
    Zhang, Xingli
    Wang, Xiaohong
    Zhang, Zihan
    Wang, Zhihui
    ELECTRONICS, 2023, 12 (11)
  • [9] FCT: fusing CNN and transformer for scene classification
    Yuxiang Xie
    Jie Yan
    Lai Kang
    Yanming Guo
    Jiahui Zhang
    Xidao Luan
    International Journal of Multimedia Information Retrieval, 2022, 11 : 611 - 618
  • [10] Classification of Endoscopy and Video Capsule Images Using CNN-Transformer Model
    Subedi, Aliza
    Regmi, Smriti
    Regmi, Nisha
    Bhusal, Bhumi
    Bagci, Ulas
    Jha, Debesh
    CANCER PREVENTION, DETECTION, AND INTERVENTION, CAPTION 2024, 2025, 15199 : 26 - 36