Transcription of Polyphonic Vocal Music with a Repetitive Melodic Structure

被引：2

作者：

Bohak, Ciril ^{[1
]}

Marolt, Matija ^{[1
]}

机构：

[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia

来源：

JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2016年 / 64卷 / 09期

关键词：

AUTOMATIC TRANSCRIPTION;

D O I：

10.17743/jaes.2016.0033

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a novel method for transcription of folk music that exploits its specifics to improve transcription accuracy. In contrast to most commercial music, folk music recordings may contain various inaccuracies as they are usually performed by amateur musicians and recorded in the field. If we use standard approaches for transcription, these inaccuracies are reflected in erroneous pitch estimates. On the other hand, the structure of western folk music is usually simple as songs are often composed of repeated melodic parts. In our approach we make use of these repetitions to increase transcription robustness and improve its accuracy. The proposed method fuses three sources of information: (1) frame-based multiple FO estimates, (2) song structure, and (3) pitch drift estimates. It first selects a representative segment of the analyzed song and aligns all the other segments to it considering temporal as well as frequency deviations. Information from all segments is summarized and used in a two-layer probabilistic model based on explicit duration HMMs, to segment frame-based information into notes. The method is evaluated with state-of-the-art transcription methods where we show that significant improvement in accuracy can be achieved.

引用

页码：664 / 672

页数：9

共 50 条

[31] Automatic transcription of polyphonic music using the multiresolution Fourier Transform
Keren, R
Zeevi, YY
Chazan, D
MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 654 - 657
[32] Instrument Learning and Sparse NMD for Automatic Polyphonic Music Transcription
Rizzi, Antonello
Antonelli, Mario
Luzi, Massimiliano
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1405 - 1415
[33] Automatic Transcription of Flamenco Singing From Polyphonic Music Recordings
Kroher, Nadine
Gomez, Emilia
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 901 - 913
[34] DRUM TRANSCRIPTION FROM POLYPHONIC MUSIC WITH RECURRENT NEURAL NETWORKS
Vogl, Richard
Dorfer, Matthias
Knees, Peter
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 201 - 205
[35] Application of Auditory Filter-Banks in Polyphonic Music Transcription
Velazquez Lopez, Omar
Oropeza Rodriguez, Jose Luis
Suarez Guerra, Sergio
COMPUTACION Y SISTEMAS, 2022, 26 (04): : 1421 - 1428
[36] Melodic Shape Stylization for Robust and Efficient Motif Detection in Hindustani Vocal Music
Ganguli, Kaustuv Kanti
Lele, Ashwin
Pinjani, Saurabh
Rao, Preeti
Srinivasamurthy, Ajay
Gulati, Sankalp
2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
[37] DISCOVERY OF REPEATED VOCAL PATTERNS IN POLYPHONIC AUDIO: A CASE STUDY ON FLAMENCO MUSIC
Kroher, Nadine
Pikrakis, Aggelos
Moreno, Jesus
Diaz-Banez, Jose-Miguel
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 41 - 45
[38] Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals
Reddy, Gurunath M.
Rao, K. Sreenivasa
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3309 - 3313
[39] Emotional expression in music:: Effects of melodic structure and performance
Lindström, E
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 391 - 391
[40] On the Effect of Memory Width in Automatic Transcription Systems for Polyphonic Piano Music
Costantini, Giovanni
Todisco, Massimiliano
Saggio, Giovanni
IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 124 - 127

← 1 2 3 4 5 →