Melody transcription from music audio:: Approaches and evaluation

被引:86
|
作者
Poliner, Graham E. [1 ]
Ellis, Daniel P. W.
Ehmann, Andreas F.
Gomez, Emilia
Streich, Sebastian
Ong, Beesuan
机构
[1] Columbia Univ, Dept Elect Engn, LabROSA, New York, NY 10027 USA
[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
[3] Univ Pompeu Fabra, Mus Techol Grp, Barcelona 08002, Spain
基金
美国国家科学基金会; 美国安德鲁·梅隆基金会;
关键词
audio; evaluation; melody transcription; music;
D O I
10.1109/TASL.2006.889797
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although the process of analyzing an audio recording of a music performance is complex and difficult even for a human listener, there are limited forms of information that may be tractably extracted and yet still enable interesting applications. We discuss melody-roughly, the part a listener might whistle or hum-as one such reduced descriptor of music audio, and consider how to define it, and what use it might be. We go on to describe the results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results. For our definition of melody, current systems can achieve around 70% correct transcription at the frame level, including distinguishing between the presence or absence of the melody. Melodies transcribed at this level are readily recognizable, and show promise for practical applications.
引用
收藏
页码:1247 / 1256
页数:10
相关论文
共 50 条
  • [21] An Evaluation of Different Evolutionary Approaches Applied in the Process of Automatic Transcription of Music Scores into Tablatures
    Ramos, Joao Victor
    Ramos, Andre Stylianos
    Silla, Carlos N., Jr.
    Sanches, Danilo Sipoli
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 663 - 669
  • [22] Audio-Based Melody Categorization: Exploring Signal Representations and Evaluation Strategies
    Kroher, Nadine
    Diaz-Banez, Jose-Miguel
    COMPUTER MUSIC JOURNAL, 2018, 41 (04) : 64 - 82
  • [23] Visualizing similarity among estimated melody sequences from musical audio
    Hashiguchi, Hiroki
    GRAMMAR OF TECHNOLOGY DEVELOPMENT, 2008, : 213 - 221
  • [24] MELODY LINE ESTIMATION IN HOMOPHONIC MUSIC AUDIO SIGNALS BASED ON TEMPORAL-VARIABILITY OF MELODIC SOURCE
    Tachibana, Hideyuki
    Ono, Takuma
    Ono, Nobutaka
    Sagayama, Shigeki
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 425 - 428
  • [25] Melody on the Threshold in Spectral Music
    Donaldson, James
    MUSIC THEORY ONLINE, 2021, 27 (02):
  • [26] TOWARDS END-TO-END POLYPHONIC MUSIC TRANSCRIPTION: TRANSFORMING MUSIC AUDIO DIRECTLY TO A SCORE
    Correa Carvalho, Ralf Gunter
    Smaragdis, Paris
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 151 - 155
  • [27] MELODY IS SEPARABLE FROM RHYTHM IN MUSIC DISCRIMINATION - EVIDENCE FROM NEUROPSYCHOLOGY
    PERETZ, I
    KOLINSKY, R
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (06) : 476 - 476
  • [28] Audio-to-Score Alignment Using Deep Automatic Music Transcription
    Simonetta, Federico
    Ntalampiras, Stavros
    Avanzini, Federico
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [29] Automatic Piano Music Transcription Using Audio-Visual Features
    Wan Yulong
    Wang Xianliang
    Zhou Ruohua
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 596 - 603
  • [30] Automatic Piano Music Transcription Using Audio-Visual Features
    WAN Yulong
    WANG Xianliang
    ZHOU Ruohua
    YAN Yonghong
    Chinese Journal of Electronics, 2015, 24 (03) : 596 - 603