Basic filters for convolutional neural networks applied to music: Training or design?

被引:13
|
作者
Doerfler, Monika [1 ]
Grill, Thomas [2 ]
Rammer, Roswitha [1 ]
Flexer, Arthur [2 ]
机构
[1] Univ Vienna, Fac Math, A-1090 Vienna, Austria
[2] Austrian Res Inst Artificial Intelligence OFAI, Freyung 6-6, A-1010 Vienna, Austria
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 04期
关键词
Machine learning; Convolutional neural networks; Adaptive filters; Gabor multipliers; Mel-spectrogram; End-to-end learning; OPERATORS;
D O I
10.1007/s00521-018-3704-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When convolutional neural networks are used to tackle learning problems based on music or other time series, raw one-dimensional data are commonly preprocessed to obtain spectrogram or mel-spectrogram coefficients, which are then used as input to the actual neural network. In this contribution, we investigate, both theoretically and experimentally, the influence of this pre-processing step on the network's performance and pose the question whether replacing it by applying adaptive or learned filters directly to the raw data can improve learning success. The theoretical results show that approximately reproducing mel-spectrogram coefficients by applying adaptive filters and subsequent time-averaging on the squared amplitudes is in principle possible. We also conducted extensive experimental work on the task of singing voice detection in music. The results of these experiments show that for classification based on convolutional neural networks the features obtained from adaptive filter banks followed by time-averaging the squared modulus of the filters' output perform better than the canonical Fourier transform-based mel-spectrogram coefficients. Alternative adaptive approaches with center frequencies or time-averaging lengths learned from training data perform equally well.
引用
收藏
页码:941 / 954
页数:14
相关论文
共 50 条
  • [11] Adaptive filters in Graph Convolutional Neural Networks
    Apicella, Andrea
    Isgro, Francesco
    Pollastro, Andrea
    Prevete, Roberto
    PATTERN RECOGNITION, 2023, 144
  • [12] Learning the number of filters in convolutional neural networks
    Li, Jue
    Cao, Feng
    Cheng, Honghong
    Qian, Yuhua
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2021, 17 (02) : 75 - 84
  • [13] Convolutional Filters and Neural Networks With Noncommutative Algebras
    Parada-Mayorga, Alejandro
    Butler, Landon
    Ribeiro, Alejandro
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 2683 - 2698
  • [14] Graph Neural Networks With Convolutional ARMA Filters
    Bianchi, Filippo Maria
    Grattarola, Daniele
    Livi, Lorenzo
    Alippi, Cesare
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3496 - 3507
  • [15] Initialization of Convolutional Neural Networks by Gabor Filters
    Ozbulak, Gokhan
    Ekenel, Hazim Kemal
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [16] MIMO Graph Filters for Convolutional Neural Networks
    Gama, Fernando
    Marques, Antonio G.
    Ribeiro, Alejandro
    Leus, Geert
    2018 IEEE 19TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC), 2018, : 651 - 655
  • [17] Latent Training for Convolutional Neural Networks
    Huang, Zi
    Liu, Qi
    Chen, Zhiyuan
    Zhao, Yuming
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 55 - 60
  • [18] CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR MUSIC CLASSIFICATION
    Choi, Keunwoo
    Fazekas, Gyorgy
    Sandler, Mark
    Cho, Kyunghyun
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2392 - 2396
  • [19] Training a V1 Like Layer Using Gabor Filters in Convolutional Neural Networks
    Bai, Jun
    Zeng, Yi
    Zhao, Yuxuan
    Zhao, Feifei
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [20] Compression of Convolutional Neural Networks With Divergent Representation of Filters
    Lei, Peng
    Liang, Jiawei
    Zheng, Tong
    Wang, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4125 - 4137