A Multi-Scale Fully Convolutional Network for Singing Melody Extraction

被引:0
|
作者
Gao, Ping [1 ]
You, Cheng-You [1 ]
Chi, Tai-Shih [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan
关键词
D O I
10.1109/apsipaasc47483.2019.9023231
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The melody extraction can be considered as a sequence-to-sequence task or a classification task. Many recent models based on semantic segmentation have been proven very effective in melody extraction. In this paper, we built up a fully convolutional network (FCN) for melody extraction from polyphonic music. Inspired by the state-of-the-art architecture of the semantic segmentation, we constructed the encoder in a dense way and designed the decoder accordingly for audio processing. The combined frequency and periodicity (CFP) representation, which contains spectral and cepstral information, was adopted as the input feature of the proposed model. We conducted performance comparison between the proposed model and several methods on various datasets. Experimental results show the proposed model achieves state-of-the-art performance with less computation and fewer parameters.
引用
收藏
页码:1288 / 1293
页数:6
相关论文
共 50 条
  • [1] A MULTI-DILATION AND MULTI-RESOLUTION FULLY CONVOLUTIONAL NETWORK FOR SINGING MELODY EXTRACTION
    Gao, Ping
    You, Cheng-You
    Chi, Tai-Shih
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 551 - 555
  • [2] Multi-scale fully convolutional neural network for building extraction
    Cui W.
    Xiong B.
    Zhang L.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2019, 48 (05): : 597 - 608
  • [3] A Multi-Scale Cascade Fully Convolutional Network Face Detector
    Yang, Zhenheng
    Nevatia, Ramakant
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 633 - 638
  • [4] Multi-Scale Fully Convolutional Network for Face Detection in the Wild
    Bai, Yancheng
    Ghanem, Bernard
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 2078 - 2087
  • [5] Detecting and Classifying Nuclei Using Multi-Scale Fully Convolutional Network
    Xin, Bin
    Yang, Yaning
    Xie, Xiaolan
    Shang, Jiandong
    Liu, Zhengyu
    Peng, Shaoliang
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (10) : 1095 - 1103
  • [6] Salient Object Detection with Chained Multi-Scale Fully Convolutional Network
    Tang, Youbao
    Wu, Xiangqian
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 618 - 626
  • [7] ME-FCN: A Multi-Scale Feature-Enhanced Fully Convolutional Network for Building Footprint Extraction
    Sheng, Hui
    Zhang, Yaoteng
    Zhang, Wei
    Wei, Shiqing
    Xu, Mingming
    Muhammad, Yasir
    REMOTE SENSING, 2024, 16 (22)
  • [8] A Part-Aware Multi-Scale Fully Convolutional Network for Pedestrian Detection
    Yang, Peiyu
    Zhang, Guofeng
    Wang, Lu
    Xu, Lisheng
    Deng, Qingxu
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) : 1125 - 1137
  • [9] Multi-Scale Retinal Vessel Segmentation Based on Fully Convolutional Neural Network
    Zheng Tingyue
    Tang Chen
    Lei Zhenkun
    ACTA OPTICA SINICA, 2019, 39 (02)
  • [10] Ground Crack Recognition Based on Fully Convolutional Network With Multi-Scale Input
    Cheng, Jian
    Ye, Liang
    Guo, Yinan
    Zhang, Jun
    An, Hongbo
    IEEE ACCESS, 2020, 8 : 53034 - 53048