Probabilistic approach to automatic music transcription from audio signals

被引：0

作者：

Miyamoto, Kenichi ^{[1
]}

Kameoka, Hirokazu ^{[1
]}

Takeda, Haruto ^{[1
]}

Nishimoto, Takuya ^{[1
]}

Sagayama, Shigeki ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1138656, Japan

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3 | 2007年

关键词：

music transcription; harmonic-temporal-structured clustering; rhythm estimation; HMM;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We discuss automatic music transcription from audio input to music score by integrating our probabilistic approaches to multipitch spectral analysis, rhythm recognition and tempo estimation. In spectral analysis, acoustic energies in spectrogram are clustered into acoustic objects (i.e., music notes) with our method called Harmonic-Temporal-structured Clustering (HTC) utilizing EM algorithm over a structured Gaussian mixture with constraints of harmonic structure and temporal smoothness. After onset and offset timings are found from separated energies of music notes through note power envelope modeling to obtain the piano-roll representation, the rhythm and tempo are simultaneously recognized and estimated in terms of maximum posterior probability given a probabilistic note duration models with HMM (Hidden Markov Model) and probabilistic "rhythm vocabulary." Variable tempo is also modeled by a smooth analytic curve. Rhythm recognition and tempo estimation is alternately performed to iteratively maximize the joint posterior probability. Experimental results are also shown.

引用

页码：697 / +

页数：2

共 50 条

[1] Automatic music transcription and audio source separation
Plumbley, MD
Abdallah, SA
Bello, JP
Davies, ME
Monti, G
Sandler, MB
CYBERNETICS AND SYSTEMS, 2002, 33 (06) : 603 - 627
[2] A Multimodal Approach for Percussion Music Transcription from Audio and Video
Marenco, Bernardo
Fuentes, Magdalena
Lanzaro, Florencia
Rocamora, Martin
Gomez, Alvaro
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 92 - 99
[3] Automatic mood detection and tracking of music audio signals
Lu, L
Liu, D
Zhang, HJ
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 5 - 18
[4] Software Tool for Audio Signal Analysis and Automatic Music Transcription
Chis, Lucian-Gheorghe
Marcu, Marius
Dragan, Florin
2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI), 2018, : 497 - 501
[5] AUTOMATIC TRANSCRIPTION OF GUITAR TABLATURE FROM AUDIO SIGNALS IN ACCORDANCE WITH PLAYER'S PROFICIENCY
Yazawa, Kazuki
Itoyama, Katsutoshi
Okuno, Hiroshi G.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[6] Audio-to-Score Alignment Using Deep Automatic Music Transcription
Simonetta, Federico
Ntalampiras, Stavros
Avanzini, Federico
IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
[7] Automatic Piano Music Transcription Using Audio-Visual Features
Wan Yulong
Wang Xianliang
Zhou Ruohua
Yan Yonghong
CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 596 - 603
[8] Automatic Piano Music Transcription Using Audio-Visual Features
WAN Yulong
WANG Xianliang
ZHOU Ruohua
YAN Yonghong
Chinese Journal of Electronics, 2015, 24 (03) : 596 - 603
[9] Automatic transcription of piano music using audio-vision fusion
Wan, Yulong
Wu, Zhigang
Zhou, Ruohua
Yan, Yonghong
MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 742 - +
[10] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
Gu, Xiangming
Ou, Longshen
Zeng, Wei
Zhang, Jianan
Wong, Nicholas
Wang, Ye
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)

← 1 2 3 4 5 →