Basic filters for convolutional neural networks applied to music: Training or design?

被引:13
|
作者
Doerfler, Monika [1 ]
Grill, Thomas [2 ]
Rammer, Roswitha [1 ]
Flexer, Arthur [2 ]
机构
[1] Univ Vienna, Fac Math, A-1090 Vienna, Austria
[2] Austrian Res Inst Artificial Intelligence OFAI, Freyung 6-6, A-1010 Vienna, Austria
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 04期
关键词
Machine learning; Convolutional neural networks; Adaptive filters; Gabor multipliers; Mel-spectrogram; End-to-end learning; OPERATORS;
D O I
10.1007/s00521-018-3704-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When convolutional neural networks are used to tackle learning problems based on music or other time series, raw one-dimensional data are commonly preprocessed to obtain spectrogram or mel-spectrogram coefficients, which are then used as input to the actual neural network. In this contribution, we investigate, both theoretically and experimentally, the influence of this pre-processing step on the network's performance and pose the question whether replacing it by applying adaptive or learned filters directly to the raw data can improve learning success. The theoretical results show that approximately reproducing mel-spectrogram coefficients by applying adaptive filters and subsequent time-averaging on the squared amplitudes is in principle possible. We also conducted extensive experimental work on the task of singing voice detection in music. The results of these experiments show that for classification based on convolutional neural networks the features obtained from adaptive filter banks followed by time-averaging the squared modulus of the filters' output perform better than the canonical Fourier transform-based mel-spectrogram coefficients. Alternative adaptive approaches with center frequencies or time-averaging lengths learned from training data perform equally well.
引用
收藏
页码:941 / 954
页数:14
相关论文
共 50 条
  • [1] Basic filters for convolutional neural networks applied to music: Training or design?
    Monika Dörfler
    Thomas Grill
    Roswitha Bammer
    Arthur Flexer
    Neural Computing and Applications, 2020, 32 : 941 - 954
  • [2] Convolutional Neural Networks with Recurrent Neural Filters
    Yang, Yi
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 912 - 917
  • [3] Symmetrical filters in convolutional neural networks
    Dzhezyan, Gregory
    Cecotti, Hubert
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (07) : 2027 - 2039
  • [4] Correlative Filters for Convolutional Neural Networks
    Chen, Peiqiu
    Wang, Hanli
    Wu, Jun
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3042 - 3047
  • [5] Symmetrical filters in convolutional neural networks
    Gregory Dzhezyan
    Hubert Cecotti
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 2027 - 2039
  • [6] Convolutional Neural Networks and Transfer Learning Applied to Automatic Composition of Descriptive Music
    Martin-Gomez, Lucia
    Perez-Marcos, Javier
    Navarro-Caceres, Maria
    Rodriguez-Gonzalez, Sara
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 801 : 275 - 282
  • [7] Compressing Convolutional Neural Networks via Factorized Convolutional Filters
    Li, Tuanhui
    Wu, Baoyuan
    Yang, Yujiu
    Fan, Yanbo
    Zhang, Yong
    Liu, Wei
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3972 - 3981
  • [8] Convolutional Neural Networks with analytically determined Filters
    Kissel, Matthias
    Diepold, Klaus
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] Learning to Prune Filters in Convolutional Neural Networks
    Huang, Qiangui
    Zhou, Kevin
    You, Suya
    Neumann, Ulrich
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 709 - 718
  • [10] Deterministic Binary Filters for Convolutional Neural Networks
    Tseng, Vincent W-S
    Bhattachara, Sourav
    Fernandez-Marques, Javier
    Alizadeh, Milad
    Tong, Catherine
    Lane, Nicholas D.
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2739 - 2747