Multipitch tracking in music signals using Echo State Networks

被引:0
|
作者
Steiner, Peter [1 ]
Stone, Simon [1 ]
Birkholz, Peter [1 ]
Jalalvand, Azarakhsh [2 ]
机构
[1] Tech Univ Dresden, Inst Acoust & Speech Commun, Dresden, Germany
[2] Univ Ghent, IMEC, IDLab, Ghent, Belgium
关键词
Reservoir Computing; Echo State Network; Multipitch; RNN; MIR;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Currently, convolutional neural networks (CNNs) define the state of the art for multipitch tracking in music signals. Echo State Networks (ESNs), a recently introduced recurrent neural network architecture, achieved similar results as CNNs for various tasks, such as phoneme or digit recognition. However, they have not yet received much attention in the community of Music Information Retrieval. The core of ESNs is a group of unordered, randomly connected neurons, i.e., the reservoir, by which the low-dimensional input space is non-linearly transformed into a high-dimensional feature space. Because only the weights of the connections between the reservoir and the output are trained using linear regression, ESNs are easier to train than deep neural networks. This paper presents a first exploration of ESNs for the challenging task of multipitch tracking in music signals. The best results presented in this paper were achieved with a bidirectional two-layer ESN with 20 000 neurons in each layer. Although the final F -score of 0.7198 still falls below the state of the art (0.7370), the proposed ESN-based approach serves as a baseline for further investigations of ESNs in audio signal processing in the future.
引用
收藏
页码:126 / 130
页数:5
相关论文
共 50 条
  • [41] Pose regulation of a constrained circular object using Echo State Networks
    Garcia-Rodriguez, Rodolfo
    Parra-Vega, Vicente
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (01) : 583 - 594
  • [42] Exchange Rate Forecasting Using Echo State Networks for Trading Strategies
    Maciel, Leandro
    Gomide, Fernando
    Santos, David
    Ballini, Rosangela
    2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING & ECONOMICS (CIFER), 2014, : 40 - 47
  • [43] Investigating forced transient chaos in monsoon using Echo State Networks
    Kapil, Chandan
    Barde, Vasundhara
    Seemala, Gopi K.
    Dimri, A. P.
    CLIMATE DYNAMICS, 2024, 62 (07) : 5759 - 5768
  • [44] Structural health monitoring of a footbridge using Echo State Networks and NARMAX
    Wootton, Adam J.
    Butcher, John B.
    Kyriacou, Theocharis
    Day, Charles R.
    Haycock, Peter W.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 64 : 152 - 163
  • [45] Direct Control of an Active Tactile Sensor Using Echo State Networks
    Krause, Andre Frank
    Blaesing, Bettina
    Duerr, Volker
    Schack, Thomas
    HUMAN CENTERED ROBOT SYSTEMS: COGNITION, INTERACTION, TECHNOLOGY, 2009, 6 : 11 - +
  • [46] Real-time emotion recognition using echo state networks
    Scherer, Stefan
    Oubbati, Mohamed
    Schwenker, Friedhelm
    Palm, Guenther
    PERCEPTION IN MULTIMODAL DIALOGUE SYSTEMS, PROCEEDINGS, 2008, 5078 : 200 - 204
  • [47] Swing Control of a Lower Extremity Exoskeleton Using Echo State Networks
    Choi, Byunghun
    Seo, Changhoon
    Lee, Sanghoon
    Kim, Byungun
    Kim, Dojong
    IFAC PAPERSONLINE, 2017, 50 (01): : 1328 - 1333
  • [48] Finite Mixture Spectrogram Modeling for Multipitch Tracking Using A Factorial Hidden Markov Model
    Wohlmayr, Michael
    Pernkopf, Franz
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1103 - 1106
  • [49] Efficient Embedded Machine Learning applications using Echo State Networks
    Cerina, I
    Santambrogio, M. D.
    Franco, G.
    Gallicchio, C.
    Micheli, A.
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 1299 - 1302
  • [50] Blood Vessel Segmentation In Retinal Images Using Echo State Networks
    Souahlia, Abdelkerim
    Belatreche, Ammar
    Benyettou, Abdelkader
    Curran, Kevin
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2017, : 91 - 98