Musical Genre Classification Based on Deep Residual Auto-Encoder and Support Vector Machine

被引:0
|
作者
Han, Xue [1 ]
Chen, Wenzhuo [2 ]
Zhou, Changjian [2 ]
机构
[1] Northeast Agr Univ, Sch Arts, Harbin, Peoples R China
[2] Northeast Agr Univ, Dept Data & Comp, Harbin, Peoples R China
来源
关键词
Deep Residual Auto-Encoder; MFCC; Music Artificial Intelligence; Musical Genre Classification;
D O I
10.3745/JIPS.04.0300
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Music brings pleasure and relaxation to people. Therefore, it is necessary to classify musical genres based on scenes. Identifying favorite musical genres from massive music data is a time-consuming and laborious task. Recent studies have suggested that machine learning algorithms are effective in distinguishing between various musical genres. However, meeting the actual requirements in terms of accuracy or timeliness is challenging. In this study, a hybrid machine learning model that combines a deep residual auto -encoder (DRAE) and support vector machine (SVM) for musical genre recognition was proposed. Eight manually extracted features from the Mel -frequency cepstral coefficients (MFCC) were employed in the preprocessing stage as the hybrid music data source. During the training stage, DRAE was employed to extract feature maps, which were then used as input for the SVM classifier. The experimental results indicated that this method achieved a 91.54% F1 -score and 91.58% top -1 accuracy, outperforming existing approaches. This novel approach leverages deep architecture and conventional machine learning algorithms and provides a new horizon for musical genre classification tasks.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 50 条
  • [41] Active Learning Music Genre Classification Based on Support Vector Machine
    Deng G.
    Ko Y.C.
    Advances in Multimedia, 2022, 2022
  • [42] Stacked sparse auto-encoder for deep clustering
    Cai, Jinyu
    Wang, Shiping
    Guo, Wenzhong
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1532 - 1538
  • [43] A DEEP CONVOLUTIONAL AUTO-ENCODER WITH EMBEDDED CLUSTERING
    Alqahtani, A.
    Xie, X.
    Deng, J.
    Jones, M. W.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 4058 - 4062
  • [44] Creation of a Deep Convolutional Auto-Encoder in Caffe
    Turchenko, Volodymyr
    Luczak, Artur
    PROCEEDINGS OF THE 2017 9TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS), VOL 2, 2017, : 651 - 659
  • [45] Exploiting the Auto-Encoder Residual Error for Intrusion Detection
    Andresini, Giuseppina
    Appice, Annalisa
    Di Mauro, Nicola
    Loglisci, Corrado
    Malerba, Donato
    2019 4TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (EUROS&PW), 2019, : 281 - 290
  • [46] Audio Genre Classification Employing Support Vector Machine
    Mali, Gopal
    Mahajan, Shrinivas P.
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [47] Image Geo-Site Estimation Using Convolutional Auto-Encoder and Multi-Label Support Vector Machine
    Jain, Arpit
    Verma, Chaman
    Kumar, Neerendra
    Raboaca, Maria Simona
    Baliya, Jyoti Narayan
    Suciu, George
    INFORMATION, 2023, 14 (01)
  • [48] Classification of power loads based on an improved denoising deconvolutional auto-encoder
    Wu, Jianhua
    Liu, Jiahan
    Ma, Jian
    Chen, Kexu
    Xu, Chunhua
    APPLIED SOFT COMPUTING, 2020, 87 (87)
  • [49] Auto-encoder Based for High Spectral Dimensional Data Classification and Visualization
    Zhu, Jiang
    Wu, Lingda
    Hao, Hongxing
    Song, Xiaorui
    Lu, Yi
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 350 - 354
  • [50] Efficient Feature Coding Based on Auto-encoder Network for Image Classification
    Xie, Guo-Sen
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 628 - 642