Transcription of Polyphonic Vocal Music with a Repetitive Melodic Structure

被引:2
|
作者
Bohak, Ciril [1 ]
Marolt, Matija [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia
来源
关键词
AUTOMATIC TRANSCRIPTION;
D O I
10.17743/jaes.2016.0033
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for transcription of folk music that exploits its specifics to improve transcription accuracy. In contrast to most commercial music, folk music recordings may contain various inaccuracies as they are usually performed by amateur musicians and recorded in the field. If we use standard approaches for transcription, these inaccuracies are reflected in erroneous pitch estimates. On the other hand, the structure of western folk music is usually simple as songs are often composed of repeated melodic parts. In our approach we make use of these repetitions to increase transcription robustness and improve its accuracy. The proposed method fuses three sources of information: (1) frame-based multiple FO estimates, (2) song structure, and (3) pitch drift estimates. It first selects a representative segment of the analyzed song and aligns all the other segments to it considering temporal as well as frequency deviations. Information from all segments is summarized and used in a two-layer probabilistic model based on explicit duration HMMs, to segment frame-based information into notes. The method is evaluated with state-of-the-art transcription methods where we show that significant improvement in accuracy can be achieved.
引用
收藏
页码:664 / 672
页数:9
相关论文
共 50 条
  • [1] Automatic Transcription of Polyphonic Vocal Music
    McLeod, Andrew
    Schramm, Rodrigo
    Steedman, Mark
    Benetos, Emmanouil
    APPLIED SCIENCES-BASEL, 2017, 7 (12):
  • [2] TRANSCRIBING VOCAL EXPRESSION FROM POLYPHONIC MUSIC
    Ikemiya, Yukara
    Itoyama, Katsutoshi
    Okuno, Hiroshi G.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Predominant vocal pitch detection in polyphonic music
    Shao, Xi
    Xu, Changsheng
    Kankanhalli, Mohan S.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 897 - 900
  • [4] POLYPHONIC MUSIC TRANSCRIPTION WITH SEMANTIC SEGMENTATION
    Wu, Yu-Te
    Chen, Berlin
    Su, Li
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 166 - 170
  • [5] Automatic transcription of piano polyphonic music
    Kobzantsev, A
    Chazan, D
    Zeevi, Y
    ISPA 2005: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005, : 414 - 418
  • [6] A Genetic Algorithm Approach with Harmonic Structure Evolution for Polyphonic Music Transcription
    Reis, Gustavo
    Fonseca, Nuno
    Fernandez, Francisco
    Ferreira, Anibal
    ISSPIT: 8TH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2008, : 491 - +
  • [7] Generative model based polyphonic music transcription
    Cemgil, AT
    Kappen, B
    Barber, D
    2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 181 - 184
  • [8] FUSING TRANSCRIPTION RESULTS FROM POLYPHONIC AND MONOPHONIC AUDIO FOR SINGING MELODY TRANSCRIPTION IN POLYPHONIC MUSIC
    Zhu, Bilei
    Wu, Fuzhang
    Li, Ke
    Wu, Yongjian
    Huang, Feiyue
    Wu, Yunsheng
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 296 - 300
  • [9] Genetic algorithm approach to polyphonic music transcription
    Reis, Gustavo
    Fonseca, Nuno
    Ferndandez, Francisco
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING, CONFERENCE PROCEEDINGS BOOK, 2007, : 321 - 326
  • [10] Transcription of polyphonic piano music with neural networks
    Marolt, M
    MELECON 2000: INFORMATION TECHNOLOGY AND ELECTROTECHNOLOGY FOR THE MEDITERRANEAN COUNTRIES, VOLS 1-3, PROCEEDINGS, 2000, : 512 - 515