Probabilistic approach to automatic music transcription from audio signals

被引:0
|
作者
Miyamoto, Kenichi [1 ]
Kameoka, Hirokazu [1 ]
Takeda, Haruto [1 ]
Nishimoto, Takuya [1 ]
Sagayama, Shigeki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1138656, Japan
来源
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3 | 2007年
关键词
music transcription; harmonic-temporal-structured clustering; rhythm estimation; HMM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We discuss automatic music transcription from audio input to music score by integrating our probabilistic approaches to multipitch spectral analysis, rhythm recognition and tempo estimation. In spectral analysis, acoustic energies in spectrogram are clustered into acoustic objects (i.e., music notes) with our method called Harmonic-Temporal-structured Clustering (HTC) utilizing EM algorithm over a structured Gaussian mixture with constraints of harmonic structure and temporal smoothness. After onset and offset timings are found from separated energies of music notes through note power envelope modeling to obtain the piano-roll representation, the rhythm and tempo are simultaneously recognized and estimated in terms of maximum posterior probability given a probabilistic note duration models with HMM (Hidden Markov Model) and probabilistic "rhythm vocabulary." Variable tempo is also modeled by a smooth analytic curve. Rhythm recognition and tempo estimation is alternately performed to iteratively maximize the joint posterior probability. Experimental results are also shown.
引用
收藏
页码:697 / +
页数:2
相关论文
共 50 条
  • [1] Automatic music transcription and audio source separation
    Plumbley, MD
    Abdallah, SA
    Bello, JP
    Davies, ME
    Monti, G
    Sandler, MB
    CYBERNETICS AND SYSTEMS, 2002, 33 (06) : 603 - 627
  • [2] A Multimodal Approach for Percussion Music Transcription from Audio and Video
    Marenco, Bernardo
    Fuentes, Magdalena
    Lanzaro, Florencia
    Rocamora, Martin
    Gomez, Alvaro
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 92 - 99
  • [3] Automatic mood detection and tracking of music audio signals
    Lu, L
    Liu, D
    Zhang, HJ
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 5 - 18
  • [4] Software Tool for Audio Signal Analysis and Automatic Music Transcription
    Chis, Lucian-Gheorghe
    Marcu, Marius
    Dragan, Florin
    2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI), 2018, : 497 - 501
  • [5] AUTOMATIC TRANSCRIPTION OF GUITAR TABLATURE FROM AUDIO SIGNALS IN ACCORDANCE WITH PLAYER'S PROFICIENCY
    Yazawa, Kazuki
    Itoyama, Katsutoshi
    Okuno, Hiroshi G.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Audio-to-Score Alignment Using Deep Automatic Music Transcription
    Simonetta, Federico
    Ntalampiras, Stavros
    Avanzini, Federico
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [7] Automatic Piano Music Transcription Using Audio-Visual Features
    Wan Yulong
    Wang Xianliang
    Zhou Ruohua
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 596 - 603
  • [8] Automatic Piano Music Transcription Using Audio-Visual Features
    WAN Yulong
    WANG Xianliang
    ZHOU Ruohua
    YAN Yonghong
    Chinese Journal of Electronics, 2015, 24 (03) : 596 - 603
  • [9] Automatic transcription of piano music using audio-vision fusion
    Wan, Yulong
    Wu, Zhigang
    Zhou, Ruohua
    Yan, Yonghong
    MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 742 - +
  • [10] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
    Gu, Xiangming
    Ou, Longshen
    Zeng, Wei
    Zhang, Jianan
    Wong, Nicholas
    Wang, Ye
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)