A Query-by-Singing System for Retrieving Karaoke Music

被引:26
|
作者
Yu, Hung-Ming [1 ]
Tsai, Wei-Ho [2 ,3 ]
Wang, Hsin-Min
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taipei Univ Technol, Dept Elect Engn, Taipei, Taiwan
[3] Natl Taipei Univ Technol, Grad Inst Comp & Commun Engn, Taipei, Taiwan
关键词
Bayesian information criterion; dynamic time warping; karaoke; music information retrieval; query-by-singing;
D O I
10.1109/TMM.2008.2007345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the problem of retrieving karaoke music using query-by-singing techniques. Unlike regular CD music, where the stereo sound involves two audio channels that usually sound the same, karaoke music encompasses two distinct channels in each track: one is a mixture of the lead vocals and background accompaniment, and the other consists of accompaniment only. Although the two audio channels are distinct, the accompaniments in the two channels often resemble each other. We exploit this characteristic to: i) infer the background accompaniment for the lead vocals from the accompaniment-only channel, so that the main melody underlying the lead vocals can be extracted more effectively; and ii) detect phrase onsets based on the Bayesian information criterion (BIC) to predict the onset points of a song where a user's sung query may begin, so that the similarity between the melodies of the query and the song can be examined more efficiently. To further refine extraction of the main melody, we propose correcting potential errors in the estimated sung notes by exploiting a composition characteristic of popular songs whereby the sung notes within a verse or chorus section usually vary no more than two octaves. In addition, to facilitate an efficient and accurate search of a large music database, we employ multiple-pass dynamic time warping (DTW) combined with multiple-level data abstraction (MLDA) to compare the similarities of melodies. The results of experiments conducted on a karaoke database comprised of 1071 popular songs demonstrate the feasibility of query-by-singing retrieval for karaoke music.
引用
收藏
页码:1626 / 1637
页数:12
相关论文
共 50 条
  • [1] A music retrieval system based on query-by-singing for Karaoke jukebox
    Yu, Hung-Ming
    Tsai, Wei-Ho
    Wang, Hsin-Min
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 445 - 459
  • [2] A query-by-singing technique for retrieving polyphonic objects of popular music
    Yu, HM
    Tsai, WH
    Wang, HM
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 439 - 453
  • [3] Multi-Classifier Based on a Query-by-Singing/Humming System
    Nam, Gi Pyo
    Park, Kang Ryoung
    SYMMETRY-BASEL, 2015, 7 (02): : 994 - 1016
  • [4] IMPLEMENTATION OF A MATCHING ENGINE FOR A PRACTICAL QUERY-BY-SINGING/HUMMING SYSTEM
    Jang, Dalwon
    Song, Chai-Jong
    Shin, Saim
    Park, Sung-Joo
    Jang, Sei-Jin
    Lee, Seok-Pil
    2011 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2011, : 258 - 263
  • [5] Test of pitch extraction algorithms for query-by-singing/humming system
    Jang, Dalwon
    Jang, Sei-Jin
    Lee, Seok-Pil
    2012 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2012,
  • [6] Robust Query-by-Singing/Humming System against Background Noise Environments
    Kim, Kichul
    Park, Kang Ryoung
    Park, Sung-Joo
    Lee, Soek-Pil
    Kim, Moo Young
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (02) : 720 - 725
  • [7] Implementation of a Practical Query-by-Singing/Humming (QbSH) System and Its Commercial Applications
    Song, Chai-Jong
    Park, Hochong
    Yang, Chang-Mo
    Jang, Sei-Jin
    Lee, Seok-Pil
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2013, 59 (02) : 407 - 414
  • [8] A Design of Matching Engine for a Practical Query-by-Singing/Humming System with Polyphonic Recordings
    Lee, Seok-Pil
    Yoo, Hoon
    Jang, Dalwon
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (02): : 723 - 736
  • [9] Implementation of a Practical Query-by-Singing/Humming (QbSH) System and Its Commercial Applications
    Song, Chai-Jong
    Park, Hochong
    Yang, Chang-Mo
    Jang, Sei-Jin
    Lee, Seok-Phil
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 102 - +
  • [10] Improving Query-by-Singing/Humming by Combining Melody and Lyric Information
    Wang, Chung-Che
    Jang, Jyh-Shing Roger
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 798 - 806