Transcription of Polyphonic Vocal Music with a Repetitive Melodic Structure

被引:2
|
作者
Bohak, Ciril [1 ]
Marolt, Matija [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia
来源
关键词
AUTOMATIC TRANSCRIPTION;
D O I
10.17743/jaes.2016.0033
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel method for transcription of folk music that exploits its specifics to improve transcription accuracy. In contrast to most commercial music, folk music recordings may contain various inaccuracies as they are usually performed by amateur musicians and recorded in the field. If we use standard approaches for transcription, these inaccuracies are reflected in erroneous pitch estimates. On the other hand, the structure of western folk music is usually simple as songs are often composed of repeated melodic parts. In our approach we make use of these repetitions to increase transcription robustness and improve its accuracy. The proposed method fuses three sources of information: (1) frame-based multiple FO estimates, (2) song structure, and (3) pitch drift estimates. It first selects a representative segment of the analyzed song and aligns all the other segments to it considering temporal as well as frequency deviations. Information from all segments is summarized and used in a two-layer probabilistic model based on explicit duration HMMs, to segment frame-based information into notes. The method is evaluated with state-of-the-art transcription methods where we show that significant improvement in accuracy can be achieved.
引用
收藏
页码:664 / 672
页数:9
相关论文
共 50 条
  • [31] Automatic transcription of polyphonic music using the multiresolution Fourier Transform
    Keren, R
    Zeevi, YY
    Chazan, D
    MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 654 - 657
  • [32] Instrument Learning and Sparse NMD for Automatic Polyphonic Music Transcription
    Rizzi, Antonello
    Antonelli, Mario
    Luzi, Massimiliano
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1405 - 1415
  • [33] Automatic Transcription of Flamenco Singing From Polyphonic Music Recordings
    Kroher, Nadine
    Gomez, Emilia
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 901 - 913
  • [34] DRUM TRANSCRIPTION FROM POLYPHONIC MUSIC WITH RECURRENT NEURAL NETWORKS
    Vogl, Richard
    Dorfer, Matthias
    Knees, Peter
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 201 - 205
  • [35] Application of Auditory Filter-Banks in Polyphonic Music Transcription
    Velazquez Lopez, Omar
    Oropeza Rodriguez, Jose Luis
    Suarez Guerra, Sergio
    COMPUTACION Y SISTEMAS, 2022, 26 (04): : 1421 - 1428
  • [36] Melodic Shape Stylization for Robust and Efficient Motif Detection in Hindustani Vocal Music
    Ganguli, Kaustuv Kanti
    Lele, Ashwin
    Pinjani, Saurabh
    Rao, Preeti
    Srinivasamurthy, Ajay
    Gulati, Sankalp
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [37] DISCOVERY OF REPEATED VOCAL PATTERNS IN POLYPHONIC AUDIO: A CASE STUDY ON FLAMENCO MUSIC
    Kroher, Nadine
    Pikrakis, Aggelos
    Moreno, Jesus
    Diaz-Banez, Jose-Miguel
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 41 - 45
  • [38] Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals
    Reddy, Gurunath M.
    Rao, K. Sreenivasa
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3309 - 3313
  • [39] Emotional expression in music:: Effects of melodic structure and performance
    Lindström, E
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 391 - 391
  • [40] On the Effect of Memory Width in Automatic Transcription Systems for Polyphonic Piano Music
    Costantini, Giovanni
    Todisco, Massimiliano
    Saggio, Giovanni
    IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 124 - 127