Deep learning of chroma representation for cover song identification in compression domain

被引:0
|
作者
Jiunn-Tsair Fang
Yu-Ruey Chang
Pao-Chi Chang
机构
[1] Ming Chuan University,Department of Electronic Engineering
[2] National Central University,Department of Communication Engineering
关键词
Cover song; Music retrieval; Sparse autoencoder; Descriptor; Advanced audio coding;
D O I
暂无
中图分类号
学科分类号
摘要
Methods for identifying a cover song typically involve comparing the similarity of chroma features between the query song and another song in the data set. However, considerable time is required for pairwise comparisons. In addition, to save disk space, most songs stored in the data set are in a compressed format. Therefore, to eliminate some decoding procedures, this study extracted music information directly from the modified discrete cosine transform coefficients of advanced audio coding and then mapped these coefficients to 12-dimensional chroma features. The chroma features were segmented to preserve the melodies. Each chroma feature segment was trained and learned by a sparse autoencoder, a deep learning architecture of artificial neural networks. The deep learning procedure was to transform chroma features into an intermediate representation for dimension reduction. Experimental results from a covers80 data set showed that the mean reciprocal rank increased to 0.5 and the matching time was reduced by over 94% compared with traditional approaches.
引用
收藏
页码:887 / 902
页数:15
相关论文
共 50 条
  • [31] Author Identification Using Chaos Game Representation and Deep Learning
    Stoean, Catalin
    Lichtblau, Daniel
    MATHEMATICS, 2020, 8 (11) : 1 - 19
  • [32] Video forensic compression with chroma and luminance domain information fusion for surveillance videos
    Gong, Yanchao
    Wang, Zilin
    Yang, Kaifang
    Liu, Ying
    Lim, Kengpang
    Wang, Fuping
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2024, 56 (05): : 46 - 55
  • [33] Music Genre Classification Based on Chroma Features and Deep Learning
    Shi, Leisi
    Li, Chen
    Tian, Lihua
    2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, : 81 - 86
  • [34] Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation
    Liu, Yen-Cheng
    Yeh, Yu-Ying
    Fu, Tzu-Chien
    Wang, Sheng-De
    Chiu, Wei-Chen
    Wang, Yu-Chiang Frank
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8867 - 8876
  • [35] Zero-Shot Deep Domain Adaptation With Common Representation Learning
    Kutbi, Mohammed
    Peng, Kuan-Chuan
    Wu, Ziyan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3909 - 3924
  • [36] Crop Identification Using Deep Learning on LUCAS Crop Cover Photos
    Yordanov, Momchil
    d'Andrimont, Raphael
    Martinez-Sanchez, Laura
    Lemoine, Guido
    Fasbender, Dominique
    van der Velde, Marijn
    SENSORS, 2023, 23 (14)
  • [37] Multimodal crop cover identification using deep learning and remote sensing
    Zeeshan Ramzan
    H. M. Shahzad Asif
    Muhammad Shahbaz
    Multimedia Tools and Applications, 2024, 83 : 33141 - 33159
  • [38] A NOVEL CHROMA REPRESENTATION FOR IMPROVED HDR VIDEO COMPRESSION EFFICIENCY USING THE HEVC STANDARD
    Azimi, Maryam
    Nasiopoulos, Panos
    Pourazad, Mahsa T.
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1211 - 1215
  • [39] Multimodal crop cover identification using deep learning and remote sensing
    Ramzan, Zeeshan
    Asif, H. M. Shahzad
    Shahbaz, Muhammad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33141 - 33159
  • [40] Enhancing cover song identification with hierarchical rank aggregation
    Osmalskyj, Julien
    Van Droogenbroeck, Marc
    Embrechts, Jean-Jaques
    Proceedings of the 17th International Society for Music Information Retrieval Conference, ISMIR 2016, 2016, : 136 - 142