Music style classification by jointly using CNN and Transformer

被引:0
|
作者
Tang, Rui [1 ]
Qi, Miao [1 ]
Wang, Qingnan [1 ]
机构
[1] Northeast Normal Univ, Coll Informat Sci & Technol, Changchun 130117, Peoples R China
关键词
Music style; Audio classification; CNN; Transformer;
D O I
10.1145/3651671.3651696
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Music influences people in many ways and plays an important role in human life from emotional expression to social interaction to cognitive development. However, the variety of musical styles is often difficult to distinguish. In this paper, different from existing methods that music presented in the form of audio information can be classified as a sequence of features divided by time through RNN or LSTM, a novel music style classification method is proposed by transforming music audio into audio image. Moreover, Convolutional Neural Network (CNN) and Transformer are combined to jointly extract rich audio image features for music style classification. The effectiveness of the proposed method is evaluated by a large number of ablation and comparative experiments. The experimental results demonstrate that the classification accuracy of our proposed method can achieve satisfactory classification accuracy and is better than some existing classification methods on GTZAN dataset.
引用
收藏
页码:707 / 712
页数:6
相关论文
共 50 条
  • [21] Data mining applied to music style classification
    Nie Y.-B.
    International Journal of Simulation: Systems, Science and Technology, 2016, 17 (02): : 19.1 - 19.6
  • [22] Music-evoked emotions classification using vision transformer in EEG signals
    Wang, Dong
    Lian, Jian
    Cheng, Hebin
    Zhou, Yanan
    FRONTIERS IN PSYCHOLOGY, 2024, 15
  • [23] Arabic Cultural Style Based Music Classification
    Soboh, Lama
    Elkabani, Islam
    Osman, Ziad
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 6 - 11
  • [24] Music style classification with a novel Bayesian model
    Zhou, Yatong
    Zhang, Taiyi
    Sun, Jiancheng
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 150 - 156
  • [25] Style-conditioned music generation with Transformer-GANs
    Wang, Weining
    Li, Jiahui
    Li, Yifan
    Xing, Xiaofen
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (01) : 106 - 120
  • [26] A Hybrid CNN and RNN Variant Model for Music Classification
    Ashraf, Mohsin
    Abid, Fazeel
    Din, Ikram Ud
    Rasheed, Jawad
    Yesiltepe, Mirsat
    Yeo, Sook Fern
    Ersoy, Merve T.
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [27] CTCNet: A CNN Transformer capsule network for sleep stage classification
    Zhang, Weijie
    Li, Chang
    Peng, Hu
    Qiao, Heyuan
    Chen, Xun
    MEASUREMENT, 2024, 226
  • [28] CCTSS: The Combination of CNN and Transformer with Shared Sublayer for Detection and Classification
    Gou, Aorui
    Liu, Jingjing
    Chen, Xiaoxiang
    Zeng, Xiaoyang
    Fan, Yibo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (01) : 141 - 156
  • [29] Disease Classification on Admission and on Discharge with Residual CNN-Transformer
    Lin, Yu-Ting
    Wei, Sheng-Lun
    Huang, Hen-Hsen
    Wang, Hui-Chih
    Chen, Hsin-Hsi
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 446 - 452
  • [30] EEG Motor Imagery Classification using Integrated Transformer-CNN for Assistive Technology Control
    Zare, Soroush
    Sun, Ye
    2024 IEEE/ACM CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES, CHASE 2024, 2024, : 189 - 190