Deep learning of chroma representation for cover song identification in compression domain

被引:0
|
作者
Jiunn-Tsair Fang
Yu-Ruey Chang
Pao-Chi Chang
机构
[1] Ming Chuan University,Department of Electronic Engineering
[2] National Central University,Department of Communication Engineering
关键词
Cover song; Music retrieval; Sparse autoencoder; Descriptor; Advanced audio coding;
D O I
暂无
中图分类号
学科分类号
摘要
Methods for identifying a cover song typically involve comparing the similarity of chroma features between the query song and another song in the data set. However, considerable time is required for pairwise comparisons. In addition, to save disk space, most songs stored in the data set are in a compressed format. Therefore, to eliminate some decoding procedures, this study extracted music information directly from the modified discrete cosine transform coefficients of advanced audio coding and then mapped these coefficients to 12-dimensional chroma features. The chroma features were segmented to preserve the melodies. Each chroma feature segment was trained and learned by a sparse autoencoder, a deep learning architecture of artificial neural networks. The deep learning procedure was to transform chroma features into an intermediate representation for dimension reduction. Experimental results from a covers80 data set showed that the mean reciprocal rank increased to 0.5 and the matching time was reduced by over 94% compared with traditional approaches.
引用
收藏
页码:887 / 902
页数:15
相关论文
共 50 条
  • [41] CoverHunter: Cover Song Identification with Refined Attention and Alignments
    Liu, Feng
    Tuo, Deyi
    Xu, Yinan
    Han, Xintong
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1080 - 1085
  • [42] Time complexity evaluation of cover song identification algorithms
    Ferreira, Martha Dais
    de Mello, Rodrigo Fernandes
    Applied Acoustics, 2021, 175
  • [43] Time complexity evaluation of cover song identification algorithms
    Ferreira, Martha Dais
    de Mello, Rodrigo Fernandes
    APPLIED ACOUSTICS, 2021, 175
  • [44] EFFECTIVE COVER SONG IDENTIFICATION BASED ON SKIPPING BIGRAMS
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 96 - 100
  • [45] Cochlear pitch class profile for cover song identification
    Chen, Ning
    Downie, J. Stephen
    Xiao, Hai-dong
    Zhu, Yu
    APPLIED ACOUSTICS, 2015, 99 : 92 - 96
  • [46] Enhanced Feature Summarizing for Effective Cover Song Identification
    Hu, Jingyi
    Chen, Ning
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2113 - 2126
  • [47] Improved similarity fusion scheme for cover song identification
    Fan, Yanlan
    Chen, Ning
    ELECTRONICS LETTERS, 2018, 54 (24) : 1403 - 1404
  • [48] Learning domain invariant and specific representation for cross-domain person re-identification
    Chong, Yanwen
    Peng, Chengwei
    Zhang, Chen
    Wang, Yujie
    Feng, Wenqiang
    Pan, Shaoming
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5219 - 5232
  • [49] Learning domain invariant and specific representation for cross-domain person re-identification
    Yanwen Chong
    Chengwei Peng
    Chen Zhang
    Yujie Wang
    Wenqiang Feng
    Shaoming Pan
    Applied Intelligence, 2021, 51 : 5219 - 5232
  • [50] Identification of plant vacuole proteins by exploiting deep representation learning features
    Jiao, Shihu
    Zou, Quan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2921 - 2927