FUSING TRANSCRIPTION RESULTS FROM POLYPHONIC AND MONOPHONIC AUDIO FOR SINGING MELODY TRANSCRIPTION IN POLYPHONIC MUSIC

被引:0
|
作者
Zhu, Bilei [1 ]
Wu, Fuzhang [1 ]
Li, Ke [1 ]
Wu, Yongjian [1 ]
Huang, Feiyue [1 ]
Wu, Yunsheng [1 ]
机构
[1] Tencent Youtu AI Lab, Shenzhen, Peoples R China
关键词
Singing melody transcription; polyphonic audio; monophonic singing recordings; deep neural network (DNN); pitch sequence selection; SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new system for singing melody transcription from polyphonic songs. Instead of operating solely on polyphonic audio of each song to be processed (as most existing systems do), our system takes as inputs additionally multiple monophonic recordings of people singing the song. To transcribe the singing melody in a song, our system first tracks the singing pitch from polyphonic audio of the song by using a deep neural network (DNN)-based method, and then uses the estimated pitch series as reference to select the pitch sequences extracted from the multiple monophonic singing recordings. The selected monophonic pitch sequences, as well as the DNN pitch series from the polyphonic audio, are then transcribed separately, and their transcriptions results are fused to form the final note sequence. Experimental results show that, by introducing monophonic singings into transcription, the performance of singing melody transcription from polyphonic songs can be significantly improved.
引用
收藏
页码:296 / 300
页数:5
相关论文
共 50 条
  • [1] Singing Transcription from Polyphonic Music Using Melody Contour Filtering
    He, Zhuang
    Feng, Yin
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [2] Automatic Transcription of Flamenco Singing From Polyphonic Music Recordings
    Kroher, Nadine
    Gomez, Emilia
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 901 - 913
  • [3] Automatic transcription of melody, bass line, and chords in polyphonic music
    Ryynanen, Matti P.
    Klapuri, Anssi P.
    COMPUTER MUSIC JOURNAL, 2008, 32 (03) : 72 - 86
  • [4] Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
    Kratimenos, Agelos
    Avramidis, Kleanthis
    Garoufis, Christos
    Zlatintsi, Athanasia
    Maragos, Petros
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 156 - 160
  • [5] POLYPHONIC MUSIC TRANSCRIPTION WITH SEMANTIC SEGMENTATION
    Wu, Yu-Te
    Chen, Berlin
    Su, Li
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 166 - 170
  • [6] Automatic Transcription of Polyphonic Vocal Music
    McLeod, Andrew
    Schramm, Rodrigo
    Steedman, Mark
    Benetos, Emmanouil
    APPLIED SCIENCES-BASEL, 2017, 7 (12):
  • [7] Automatic transcription of piano polyphonic music
    Kobzantsev, A
    Chazan, D
    Zeevi, Y
    ISPA 2005: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005, : 414 - 418
  • [8] SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC BASED ON SPECTRAL CORRELATION MODELING
    Du, Xingjian
    Zhu, Bilei
    Kong, Qiuglang
    Ma, Zejun
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 241 - 245
  • [9] Transcription and separation of drum signals from polyphonic music
    Gillet, Olivier
    Richard, Gael
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 529 - 540
  • [10] Automatic bass line transcription from streaming polyphonic audio
    Ryynanen, Matti
    Klapuri, Anssi
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1437 - +