Deep learning of chroma representation for cover song identification in compression domain

被引:0
|
作者
Jiunn-Tsair Fang
Yu-Ruey Chang
Pao-Chi Chang
机构
[1] Ming Chuan University,Department of Electronic Engineering
[2] National Central University,Department of Communication Engineering
关键词
Cover song; Music retrieval; Sparse autoencoder; Descriptor; Advanced audio coding;
D O I
暂无
中图分类号
学科分类号
摘要
Methods for identifying a cover song typically involve comparing the similarity of chroma features between the query song and another song in the data set. However, considerable time is required for pairwise comparisons. In addition, to save disk space, most songs stored in the data set are in a compressed format. Therefore, to eliminate some decoding procedures, this study extracted music information directly from the modified discrete cosine transform coefficients of advanced audio coding and then mapped these coefficients to 12-dimensional chroma features. The chroma features were segmented to preserve the melodies. Each chroma feature segment was trained and learned by a sparse autoencoder, a deep learning architecture of artificial neural networks. The deep learning procedure was to transform chroma features into an intermediate representation for dimension reduction. Experimental results from a covers80 data set showed that the mean reciprocal rank increased to 0.5 and the matching time was reduced by over 94% compared with traditional approaches.
引用
收藏
页码:887 / 902
页数:15
相关论文
共 50 条
  • [21] Similarity fusion scheme for cover song identification
    Chen, Ning
    Xiao, Hai-dong
    ELECTRONICS LETTERS, 2016, 52 (13) : 1173 - 1174
  • [22] Training audio transformers for cover song identification
    Zeng, Te
    Lau, Francis C. M.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [23] A HEURISTIC FOR DISTANCE FUSION IN COVER SONG IDENTIFICATION
    Degani, Alessio
    Dalai, Marco
    Leonardi, Riccardo
    Migliorati, Pierangelo
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
  • [24] Fusing similarity functions for cover song identification
    Ning Chen
    Wei Li
    Haidong Xiao
    Multimedia Tools and Applications, 2018, 77 : 2629 - 2652
  • [25] Training audio transformers for cover song identification
    Te Zeng
    Francis C. M. Lau
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [26] Cross recurrence quantification for cover song identification
    Serra, Joan
    Serra, Xavier
    Andrzejak, Ralph G.
    NEW JOURNAL OF PHYSICS, 2009, 11
  • [27] Fusing similarity functions for cover song identification
    Chen, Ning
    Li, Wei
    Xiao, Haidong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (02) : 2629 - 2652
  • [28] Cover Song Identification by Sequence Alignment Algorithms
    Wang, Chih-Li
    Zhong, Qian
    Wang, Szu-Ying
    Roychowdhury, Vwani
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [29] LEARN A ROBUST REPRESENTATION FOR COVER SONG IDENTIFICATION VIA AGGREGATING LOCAL AND GLOBAL MUSIC TEMPORAL CONTEXT
    Jiang, Chaoya
    Yang, Deshun
    Chen, Xiaoou
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [30] Deep Learning Face Representation by Joint Identification-Verification
    Sun, Yi
    Chen, Yuheng
    Wang, Xiaogang
    Tang, Xiaoou
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27