Tonal representations for music retrieval: from version identification to query-by-humming

被引:37
|
作者
Salamon, Justin [1 ]
Serra, Joan [2 ]
Gomez, Emilia [1 ]
机构
[1] Univ Pompeu Fabra, Mus Technol Grp, Barcelona, Spain
[2] Spanish Natl Res Council, Artificial Intelligence Res Inst IIIA CSIC, Bellaterra, Spain
关键词
Music similarity; Version identification; Cover song detection; Query by humming; Melody extraction; Bass line; Harmony; Music retrieval;
D O I
10.1007/s13735-012-0026-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study we compare the use of different music representations for retrieving alternative performances of the same musical piece, a task commonly referred to as version identification. Given the audio signal of a song, we compute descriptors representing its melody, bass line and harmonic progression using state-of-the-art algorithms. These descriptors are then employed to retrieve different versions of the same musical piece using a dynamic programming algorithm based on nonlinear time series analysis. First, we evaluate the accuracy obtained using individual descriptors, and then we examine whether performance can be improved by combining these music representations (i.e. descriptor fusion). Our results show that whilst harmony is themost reliablemusic representation for version identification, the melody and bass line representations also carry useful information for this task. Furthermore, we show that by combining these tonal representations we can increase version detection accuracy. Finally, we demonstrate how the proposed version identification method can be adapted for the task of query-by-humming. We propose a melody-based retrieval approach, and demonstrate how melody representations extracted from recordings of a cappella singing can be successfully used to retrieve the original song from a collection of polyphonic audio. The current limitations of the proposed approach are discussed in the context of version identification and query-by-humming, and possible solutions and future research directions are proposed.
引用
收藏
页码:45 / 58
页数:14
相关论文
共 11 条
  • [1] A robust music retrieval method for query-by-humming
    Zhu, YW
    Kankanhalli, MS
    ITRE2003: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, 2003, : 89 - 93
  • [2] Music Information Retrieval using Query-by-Humming based on the Dynamic Time Warping
    Putri, Rifki Afina
    Lestari, Dessi Puji
    5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 65 - 70
  • [3] A peer-to-peer music sharing system based on query-by-humming
    Wang, Jianrong
    Chang, Xinglong
    Zhao, Zheng
    Zhang, Yebin
    Shi, Qingwei
    NEXT-GENERATION COMMUNICATION AND SENSOR NETWORKS 2007, 2007, 6773
  • [4] A repeating pattern based Query-by-Humming fuzzy system for polyphonic melody retrieval
    Huang, Yo-Ping
    Lai, Shin-Liang
    Sandnes, Frode Eika
    APPLIED SOFT COMPUTING, 2015, 33 : 197 - 206
  • [5] Query by Singing/Humming (QbSH) System for Polyphonic Music Retrieval
    Park, Sungjoo
    Chung, Kwangsue
    2012 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2012, : 245 - +
  • [6] Query by humming: Automatically building the database from music recordings
    Rocamora, Martin
    Cancela, Pablo
    Pardo, Alvaro
    PATTERN RECOGNITION LETTERS, 2014, 36 : 272 - 280
  • [7] Music signal spotting retrieval by a humming query using model driven path continuous dynamic program
    Hashiguchi, Hiroki
    Nishimura, Takuichi
    Zhang, J. Xin
    Takita, Junko
    Oka, Ryuichi
    Systems and Computers in Japan, 2007, 38 (10): : 95 - 104
  • [8] INVESTIGATING THE EFFICACY OF MUSIC VERSION RETRIEVAL SYSTEMS FOR SETLIST IDENTIFICATION
    Yesiler, Furkan
    Molina, Emilio
    Serra, Joan
    Gomez, Emilia
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 541 - 545
  • [9] From Music Scores to Audio Recordings: Deep Pitch-Class Representations for Measuring Tonal Structures
    Weiss, Christof
    Mueller, Meinard
    ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2024, 17 (03):
  • [10] A Combined Model of Sensory and Cognitive Representations Underlying Tonal Expectations in Music: From Audio Signals to Behavior
    Collins, Tom
    Tillmann, Barbara
    Barrett, Frederick S.
    Delbe, Charles
    Janata, Petr
    PSYCHOLOGICAL REVIEW, 2014, 121 (01) : 33 - 65