FUSING TRANSCRIPTION RESULTS FROM POLYPHONIC AND MONOPHONIC AUDIO FOR SINGING MELODY TRANSCRIPTION IN POLYPHONIC MUSIC

被引：0

作者：

Zhu, Bilei ^{[1
]}

Wu, Fuzhang ^{[1
]}

Li, Ke ^{[1
]}

Wu, Yongjian ^{[1
]}

Huang, Feiyue ^{[1
]}

Wu, Yunsheng ^{[1
]}

机构：

[1] Tencent Youtu AI Lab, Shenzhen, Peoples R China

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

关键词：

Singing melody transcription; polyphonic audio; monophonic singing recordings; deep neural network (DNN); pitch sequence selection; SPEECH;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a new system for singing melody transcription from polyphonic songs. Instead of operating solely on polyphonic audio of each song to be processed (as most existing systems do), our system takes as inputs additionally multiple monophonic recordings of people singing the song. To transcribe the singing melody in a song, our system first tracks the singing pitch from polyphonic audio of the song by using a deep neural network (DNN)-based method, and then uses the estimated pitch series as reference to select the pitch sequences extracted from the multiple monophonic singing recordings. The selected monophonic pitch sequences, as well as the DNN pitch series from the polyphonic audio, are then transcribed separately, and their transcriptions results are fused to form the final note sequence. Experimental results show that, by introducing monophonic singings into transcription, the performance of singing melody transcription from polyphonic songs can be significantly improved.

引用

页码：296 / 300

页数：5

共 50 条

[41] Instrument Learning and Sparse NMD for Automatic Polyphonic Music Transcription
Rizzi, Antonello
Antonelli, Mario
Luzi, Massimiliano
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (07) : 1405 - 1415
[42] Application of Auditory Filter-Banks in Polyphonic Music Transcription
Velazquez Lopez, Omar
Oropeza Rodriguez, Jose Luis
Suarez Guerra, Sergio
COMPUTACION Y SISTEMAS, 2022, 26 (04): : 1421 - 1428
[43] Singer Diarization for Polyphonic Music With Unison Singing
Suda, Hitoshi
Saito, Daisuke
Fukayama, Satoru
Nakano, Tomoyasu
Goto, Masataka
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1531 - 1545
[44] AUTOMATIC LYRICS ALIGNMENT AND TRANSCRIPTION IN POLYPHONIC MUSIC: DOES BACKGROUND MUSIC HELP?
Gupta, Chitralekha
Yilmaz, Emre
Li, Haizhou
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 496 - 500
[45] Audio Melody Extraction from Monophonic Turkish Maqam Music
Simsek, Berrak Ozturk
Akan, Aydin
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[46] Towards a Computational Model of Melody Identification in Polyphonic Music
Madsen, Soren Tjagvad
Widmer, Gerhard
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 459 - 464
[47] Extraction of melody from polyphonic music using modified morlet wavelet
Kumar, Neeraj
Kumar, Raubin
Murmu, Govind
Sethy, Prabira Kumar
Sethy, Prabira Kumar (prabirsethy.05@gmail.com), 1600, Elsevier B.V. (80):
[48] Data representations for audio-to-score monophonic music transcription
Roman, Miguel A.
Pertusa, Antonio
Calvo-Zaragoza, Jorge
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
[49] An Effective Approach for Vocal Melody Extraction from Polyphonic Music on GPU
Yao, Guangchao
Zheng, Yao
Xiao, Limin
Ruan, Li
Lin, Zhen
Peng, Junjie
NETWORK AND PARALLEL COMPUTING, NPC 2013, 2013, 8147 : 284 - 297
[50] On the Effect of Memory Width in Automatic Transcription Systems for Polyphonic Piano Music
Costantini, Giovanni
Todisco, Massimiliano
Saggio, Giovanni
IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 124 - 127

← 1 2 3 4 5 →