FUSING TRANSCRIPTION RESULTS FROM POLYPHONIC AND MONOPHONIC AUDIO FOR SINGING MELODY TRANSCRIPTION IN POLYPHONIC MUSIC

被引:0
|
作者
Zhu, Bilei [1 ]
Wu, Fuzhang [1 ]
Li, Ke [1 ]
Wu, Yongjian [1 ]
Huang, Feiyue [1 ]
Wu, Yunsheng [1 ]
机构
[1] Tencent Youtu AI Lab, Shenzhen, Peoples R China
关键词
Singing melody transcription; polyphonic audio; monophonic singing recordings; deep neural network (DNN); pitch sequence selection; SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new system for singing melody transcription from polyphonic songs. Instead of operating solely on polyphonic audio of each song to be processed (as most existing systems do), our system takes as inputs additionally multiple monophonic recordings of people singing the song. To transcribe the singing melody in a song, our system first tracks the singing pitch from polyphonic audio of the song by using a deep neural network (DNN)-based method, and then uses the estimated pitch series as reference to select the pitch sequences extracted from the multiple monophonic singing recordings. The selected monophonic pitch sequences, as well as the DNN pitch series from the polyphonic audio, are then transcribed separately, and their transcriptions results are fused to form the final note sequence. Experimental results show that, by introducing monophonic singings into transcription, the performance of singing melody transcription from polyphonic songs can be significantly improved.
引用
收藏
页码:296 / 300
页数:5
相关论文
共 50 条
  • [41] Instrument Learning and Sparse NMD for Automatic Polyphonic Music Transcription
    Rizzi, Antonello
    Antonelli, Mario
    Luzi, Massimiliano
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1405 - 1415
  • [42] Application of Auditory Filter-Banks in Polyphonic Music Transcription
    Velazquez Lopez, Omar
    Oropeza Rodriguez, Jose Luis
    Suarez Guerra, Sergio
    COMPUTACION Y SISTEMAS, 2022, 26 (04): : 1421 - 1428
  • [43] Singer Diarization for Polyphonic Music With Unison Singing
    Suda, Hitoshi
    Saito, Daisuke
    Fukayama, Satoru
    Nakano, Tomoyasu
    Goto, Masataka
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1531 - 1545
  • [44] AUTOMATIC LYRICS ALIGNMENT AND TRANSCRIPTION IN POLYPHONIC MUSIC: DOES BACKGROUND MUSIC HELP?
    Gupta, Chitralekha
    Yilmaz, Emre
    Li, Haizhou
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 496 - 500
  • [45] Audio Melody Extraction from Monophonic Turkish Maqam Music
    Simsek, Berrak Ozturk
    Akan, Aydin
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [46] Towards a Computational Model of Melody Identification in Polyphonic Music
    Madsen, Soren Tjagvad
    Widmer, Gerhard
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 459 - 464
  • [47] Extraction of melody from polyphonic music using modified morlet wavelet
    Kumar, Neeraj
    Kumar, Raubin
    Murmu, Govind
    Sethy, Prabira Kumar
    Sethy, Prabira Kumar (prabirsethy.05@gmail.com), 1600, Elsevier B.V. (80):
  • [48] Data representations for audio-to-score monophonic music transcription
    Roman, Miguel A.
    Pertusa, Antonio
    Calvo-Zaragoza, Jorge
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
  • [49] An Effective Approach for Vocal Melody Extraction from Polyphonic Music on GPU
    Yao, Guangchao
    Zheng, Yao
    Xiao, Limin
    Ruan, Li
    Lin, Zhen
    Peng, Junjie
    NETWORK AND PARALLEL COMPUTING, NPC 2013, 2013, 8147 : 284 - 297
  • [50] On the Effect of Memory Width in Automatic Transcription Systems for Polyphonic Piano Music
    Costantini, Giovanni
    Todisco, Massimiliano
    Saggio, Giovanni
    IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 124 - 127