Melody transcription from music audio:: Approaches and evaluation

被引：86

作者：

Poliner, Graham E. ^{[1
]}

Ellis, Daniel P. W.

Ehmann, Andreas F.

Gomez, Emilia

Streich, Sebastian

Ong, Beesuan

机构：

[1] Columbia Univ, Dept Elect Engn, LabROSA, New York, NY 10027 USA

[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA

[3] Univ Pompeu Fabra, Mus Techol Grp, Barcelona 08002, Spain

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 04期

基金：

美国国家科学基金会; 美国安德鲁·梅隆基金会;

关键词：

audio; evaluation; melody transcription; music;

D O I：

10.1109/TASL.2006.889797

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Although the process of analyzing an audio recording of a music performance is complex and difficult even for a human listener, there are limited forms of information that may be tractably extracted and yet still enable interesting applications. We discuss melody-roughly, the part a listener might whistle or hum-as one such reduced descriptor of music audio, and consider how to define it, and what use it might be. We go on to describe the results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results. For our definition of melody, current systems can achieve around 70% correct transcription at the frame level, including distinguishing between the presence or absence of the melody. Melodies transcribed at this level are readily recognizable, and show promise for practical applications.

引用

页码：1247 / 1256

页数：10

共 50 条

[21] An Evaluation of Different Evolutionary Approaches Applied in the Process of Automatic Transcription of Music Scores into Tablatures
Ramos, Joao Victor
Ramos, Andre Stylianos
Silla, Carlos N., Jr.
Sanches, Danilo Sipoli
2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 663 - 669
[22] Audio-Based Melody Categorization: Exploring Signal Representations and Evaluation Strategies
Kroher, Nadine
Diaz-Banez, Jose-Miguel
COMPUTER MUSIC JOURNAL, 2018, 41 (04) : 64 - 82
[23] Visualizing similarity among estimated melody sequences from musical audio
Hashiguchi, Hiroki
GRAMMAR OF TECHNOLOGY DEVELOPMENT, 2008, : 213 - 221
[24] MELODY LINE ESTIMATION IN HOMOPHONIC MUSIC AUDIO SIGNALS BASED ON TEMPORAL-VARIABILITY OF MELODIC SOURCE
Tachibana, Hideyuki
Ono, Takuma
Ono, Nobutaka
Sagayama, Shigeki
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 425 - 428
[25] Melody on the Threshold in Spectral Music
Donaldson, James
MUSIC THEORY ONLINE, 2021, 27 (02):
[26] TOWARDS END-TO-END POLYPHONIC MUSIC TRANSCRIPTION: TRANSFORMING MUSIC AUDIO DIRECTLY TO A SCORE
Correa Carvalho, Ralf Gunter
Smaragdis, Paris
2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 151 - 155
[27] MELODY IS SEPARABLE FROM RHYTHM IN MUSIC DISCRIMINATION - EVIDENCE FROM NEUROPSYCHOLOGY
PERETZ, I
KOLINSKY, R
BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (06) : 476 - 476
[28] Audio-to-Score Alignment Using Deep Automatic Music Transcription
Simonetta, Federico
Ntalampiras, Stavros
Avanzini, Federico
IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
[29] Automatic Piano Music Transcription Using Audio-Visual Features
Wan Yulong
Wang Xianliang
Zhou Ruohua
Yan Yonghong
CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 596 - 603
[30] Automatic Piano Music Transcription Using Audio-Visual Features
WAN Yulong
WANG Xianliang
ZHOU Ruohua
YAN Yonghong
Chinese Journal of Electronics, 2015, 24 (03) : 596 - 603

← 1 2 3 4 5 →