Transcription of Polyphonic Vocal Music with a Repetitive Melodic Structure

被引:2
|
作者
Bohak, Ciril [1 ]
Marolt, Matija [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia
来源
关键词
AUTOMATIC TRANSCRIPTION;
D O I
10.17743/jaes.2016.0033
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for transcription of folk music that exploits its specifics to improve transcription accuracy. In contrast to most commercial music, folk music recordings may contain various inaccuracies as they are usually performed by amateur musicians and recorded in the field. If we use standard approaches for transcription, these inaccuracies are reflected in erroneous pitch estimates. On the other hand, the structure of western folk music is usually simple as songs are often composed of repeated melodic parts. In our approach we make use of these repetitions to increase transcription robustness and improve its accuracy. The proposed method fuses three sources of information: (1) frame-based multiple FO estimates, (2) song structure, and (3) pitch drift estimates. It first selects a representative segment of the analyzed song and aligns all the other segments to it considering temporal as well as frequency deviations. Information from all segments is summarized and used in a two-layer probabilistic model based on explicit duration HMMs, to segment frame-based information into notes. The method is evaluated with state-of-the-art transcription methods where we show that significant improvement in accuracy can be achieved.
引用
收藏
页码:664 / 672
页数:9
相关论文
共 50 条
  • [22] Transcription and separation of drum signals from polyphonic music
    Gillet, Olivier
    Richard, Gael
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 529 - 540
  • [23] Vocal Pitch Extraction in Polyphonic Music using Convolutional Residual Network
    Dong, Mingye
    Wu, Jie
    Luan, Jian
    INTERSPEECH 2019, 2019, : 2010 - 2014
  • [24] An Effective Approach for Vocal Melody Extraction from Polyphonic Music on GPU
    Yao, Guangchao
    Zheng, Yao
    Xiao, Limin
    Ruan, Li
    Lin, Zhen
    Peng, Junjie
    NETWORK AND PARALLEL COMPUTING, NPC 2013, 2013, 8147 : 284 - 297
  • [25] AUTOMATIC LYRICS ALIGNMENT AND TRANSCRIPTION IN POLYPHONIC MUSIC: DOES BACKGROUND MUSIC HELP?
    Gupta, Chitralekha
    Yilmaz, Emre
    Li, Haizhou
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 496 - 500
  • [26] Computationally inexpensive and effective scheme for automatic transcription of polyphonic music
    Lao, WL
    Tan, ET
    Kam, AH
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1775 - 1778
  • [27] AUTOMATIC TRANSCRIPTION OF PITCHED AND UNPITCHED SOUNDS FROM POLYPHONIC MUSIC
    Benetos, Emmanouil
    Ewert, Sebastian
    Weyde, Tillman
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [28] Automatic transcription of melody, bass line, and chords in polyphonic music
    Ryynanen, Matti P.
    Klapuri, Anssi P.
    COMPUTER MUSIC JOURNAL, 2008, 32 (03) : 72 - 86
  • [29] Non-negative matrix factorization for polyphonic music transcription
    Smaragdis, P
    Brown, JC
    2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 177 - 180
  • [30] POLYPHONIC MUSIC TRANSCRIPTION USING NOTE ONSET AND OFFSET DETECTION
    Benetos, Emmanouil
    Dixon, Simon
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 37 - 40