Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals

被引:4
|
作者
Reddy, Gurunath M. [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
Predominant Melody; Zero Frequency Filter; Note Onsets; Vocal Notes; Polyphonic Music; Vocals and Non-Vocals; FUNDAMENTAL-FREQUENCY ESTIMATION; SPEECH SIGNALS; AUDIO SIGNALS; TRANSCRIPTION; SEPARATION;
D O I
10.21437/Interspeech.2016-856
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A method based on the production mechanism of the vocals in the composite vocal polyphonic music signal is proposed for vocal melody extraction. In the proposed method, initially the non-pitched percussive source is suppressed by observing its wideband spectral characteristics to emphasise the harmonic content in the mixture signal. Further, the harmonic enhanced signal is segmented into vocal and non-vocal regions by thresholding the salience energy contour. The vocal regions are further divided into vocal note like regions by their spectral transition cues in the frequency domain. The melody contour in each vocal note is extracted by detecting the locations of instant of significant excitation by passing it through adaptive zero frequency filtering (ZFF) in the time domain. The experimental results showed that the proposed method is indeed comparable to the state-of-the-art saliency based melody extraction method.
引用
收藏
页码:3309 / 3313
页数:5
相关论文
共 50 条
  • [31] TONET: TONE-OCTAVE NETWORK FOR SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC
    Chen, Ke
    Yu, Shuai
    Wang, Cheng-, I
    Li, Wei
    Berg-Kirkpatrick, Taylor
    Dubnov, Shlomo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 621 - 625
  • [32] Extraction of vocal-tract system characteristics from speech signals
    Indian Inst of Technology, Madras, Spain
    IEEE Trans Speech Audio Process, 4 (313-327):
  • [33] Extraction of vocal-tract system characteristics from speech signals
    Yegnanarayana, B
    Veldhuis, RNJ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (04): : 313 - 327
  • [34] VOCAL MELODY EXTRACTION VIA DNN-BASED PITCH ESTIMATION AND SALIENCE-BASED PITCH REFINEMENT
    Gao, Yongwei
    Zhu, Bilei
    Li, Wei
    Li, Ke
    Wu, Yongjian
    Huang, Feiyue
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1000 - 1004
  • [35] GROUP DELAY BASED MELODY MONOPITCH EXTRACTION FROM MUSIC
    Rajan, Rajeev
    Murthy, Hema A.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 186 - 190
  • [36] MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music
    Yu, Shuai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 365 - 373
  • [37] MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music
    Gao, Yuan
    Hu, Ying
    Wang, Liusong
    Huang, Hao
    He, Liang
    INTERSPEECH 2023, 2023, : 5396 - 5400
  • [38] Note-event Detection in Polyphonic Musical Signals based on Harmonic Matching Pursuit and Spectral Smoothness
    Canadas-Quesada, F. J.
    Vera-Candeas, P.
    Ruiz-Reyes, N.
    Mata-Campos, R.
    Carabias-Orti, J. J.
    JOURNAL OF NEW MUSIC RESEARCH, 2008, 37 (03) : 167 - 183
  • [39] Vocal Music Teaching Pharyngeal Training Method Based on Audio Extraction by Big Data Analysis
    Huang, Chunyuan
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [40] A Deep Neural Network based approach for Vocal Extraction from songs
    Sharma, Vasu
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2015, : 116 - 121