FUSING TRANSCRIPTION RESULTS FROM POLYPHONIC AND MONOPHONIC AUDIO FOR SINGING MELODY TRANSCRIPTION IN POLYPHONIC MUSIC

被引：0

作者：

Zhu, Bilei ^{[1
]}

Wu, Fuzhang ^{[1
]}

Li, Ke ^{[1
]}

Wu, Yongjian ^{[1
]}

Huang, Feiyue ^{[1
]}

Wu, Yunsheng ^{[1
]}

机构：

[1] Tencent Youtu AI Lab, Shenzhen, Peoples R China

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

关键词：

Singing melody transcription; polyphonic audio; monophonic singing recordings; deep neural network (DNN); pitch sequence selection; SPEECH;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a new system for singing melody transcription from polyphonic songs. Instead of operating solely on polyphonic audio of each song to be processed (as most existing systems do), our system takes as inputs additionally multiple monophonic recordings of people singing the song. To transcribe the singing melody in a song, our system first tracks the singing pitch from polyphonic audio of the song by using a deep neural network (DNN)-based method, and then uses the estimated pitch series as reference to select the pitch sequences extracted from the multiple monophonic singing recordings. The selected monophonic pitch sequences, as well as the DNN pitch series from the polyphonic audio, are then transcribed separately, and their transcriptions results are fused to form the final note sequence. Experimental results show that, by introducing monophonic singings into transcription, the performance of singing melody transcription from polyphonic songs can be significantly improved.

引用

页码：296 / 300

页数：5

共 50 条

[1] Singing Transcription from Polyphonic Music Using Melody Contour Filtering
He, Zhuang
Feng, Yin
APPLIED SCIENCES-BASEL, 2021, 11 (13):
[2] Automatic Transcription of Flamenco Singing From Polyphonic Music Recordings
Kroher, Nadine
Gomez, Emilia
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 901 - 913
[3] Automatic transcription of melody, bass line, and chords in polyphonic music
Ryynanen, Matti P.
Klapuri, Anssi P.
COMPUTER MUSIC JOURNAL, 2008, 32 (03) : 72 - 86
[4] Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
Kratimenos, Agelos
Avramidis, Kleanthis
Garoufis, Christos
Zlatintsi, Athanasia
Maragos, Petros
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 156 - 160
[5] POLYPHONIC MUSIC TRANSCRIPTION WITH SEMANTIC SEGMENTATION
Wu, Yu-Te
Chen, Berlin
Su, Li
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 166 - 170
[6] Automatic Transcription of Polyphonic Vocal Music
McLeod, Andrew
Schramm, Rodrigo
Steedman, Mark
Benetos, Emmanouil
APPLIED SCIENCES-BASEL, 2017, 7 (12):
[7] Automatic transcription of piano polyphonic music
Kobzantsev, A
Chazan, D
Zeevi, Y
ISPA 2005: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005, : 414 - 418
[8] SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC BASED ON SPECTRAL CORRELATION MODELING
Du, Xingjian
Zhu, Bilei
Kong, Qiuglang
Ma, Zejun
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 241 - 245
[9] Transcription and separation of drum signals from polyphonic music
Gillet, Olivier
Richard, Gael
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 529 - 540
[10] Automatic bass line transcription from streaming polyphonic audio
Ryynanen, Matti
Klapuri, Anssi
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1437 - +

← 1 2 3 4 5 →