Basic filters for convolutional neural networks applied to music: Training or design?

被引:13
|
作者
Doerfler, Monika [1 ]
Grill, Thomas [2 ]
Rammer, Roswitha [1 ]
Flexer, Arthur [2 ]
机构
[1] Univ Vienna, Fac Math, A-1090 Vienna, Austria
[2] Austrian Res Inst Artificial Intelligence OFAI, Freyung 6-6, A-1010 Vienna, Austria
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 04期
关键词
Machine learning; Convolutional neural networks; Adaptive filters; Gabor multipliers; Mel-spectrogram; End-to-end learning; OPERATORS;
D O I
10.1007/s00521-018-3704-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When convolutional neural networks are used to tackle learning problems based on music or other time series, raw one-dimensional data are commonly preprocessed to obtain spectrogram or mel-spectrogram coefficients, which are then used as input to the actual neural network. In this contribution, we investigate, both theoretically and experimentally, the influence of this pre-processing step on the network's performance and pose the question whether replacing it by applying adaptive or learned filters directly to the raw data can improve learning success. The theoretical results show that approximately reproducing mel-spectrogram coefficients by applying adaptive filters and subsequent time-averaging on the squared amplitudes is in principle possible. We also conducted extensive experimental work on the task of singing voice detection in music. The results of these experiments show that for classification based on convolutional neural networks the features obtained from adaptive filter banks followed by time-averaging the squared modulus of the filters' output perform better than the canonical Fourier transform-based mel-spectrogram coefficients. Alternative adaptive approaches with center frequencies or time-averaging lengths learned from training data perform equally well.
引用
收藏
页码:941 / 954
页数:14
相关论文
共 50 条
  • [31] CONVOLUTIONAL NEURAL NETWORKS TRAINING FOR AUTONOMOUS ROBOTICS
    Lozhkin, Alexander
    Maiorov, Konstantin
    Bozek, Pavol
    MANAGEMENT SYSTEMS IN PRODUCTION ENGINEERING, 2021, 29 (01) : 75 - 79
  • [32] Music Feature Maps with Convolutional Neural Networks for Music Genre Classification
    Senac, Christine
    Pellegrini, Thomas
    Mouret, Florian
    Pinquier, Julien
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [33] AUTOMATIC DESIGN OF APERTURE FILTERS USING NEURAL NETWORKS APPLIED TO OCULAR IMAGE SEGMENTATION
    Benalcazar, Marco E.
    Brun, Marcel
    Ballarin, Virginia L.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2195 - 2199
  • [34] Music Artist Classification with Convolutional Recurrent Neural Networks
    Nasrullah, Zain
    Zhao, Yue
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [35] Improved Music Genre Classification with Convolutional Neural Networks
    Zhang, Weibin
    Lei, Wenkang
    Xu, Xiangmin
    Xing, Xiaofeng
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3304 - 3308
  • [36] MUSIC GENRE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Subhani, G. M.
    Shravya, Perala
    Kumar, Gorighe Akhil
    Hrithika, Chitumalla
    Shrinivas, Chimalpade Ajay
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1519 - 1526
  • [37] Convolutional Neural Networks Approach for Music Genre Classification
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Kuo, Che-Nan
    2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 399 - 403
  • [38] Automated design of digital filters using convolutional neural networks for extracting ringdown gravitational waves
    Sakai, Kazuki
    Odonchimed, Sodtavilan
    Takano, Mitsuki
    Takahashi, Hirotaka
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (04):
  • [39] Convolutional Neural Networks Applied to Human Face Classification
    Cheung, Brian
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 580 - 583
  • [40] Convolutional Neural Networks applied in the monitoring of metallic parts
    Almeida, J. H. L.
    Lopes, L. A. R.
    Silva, M. A. B.
    Amaral, J. L. M.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,