Basic filters for convolutional neural networks applied to music: Training or design?

被引:13
|
作者
Doerfler, Monika [1 ]
Grill, Thomas [2 ]
Rammer, Roswitha [1 ]
Flexer, Arthur [2 ]
机构
[1] Univ Vienna, Fac Math, A-1090 Vienna, Austria
[2] Austrian Res Inst Artificial Intelligence OFAI, Freyung 6-6, A-1010 Vienna, Austria
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 04期
关键词
Machine learning; Convolutional neural networks; Adaptive filters; Gabor multipliers; Mel-spectrogram; End-to-end learning; OPERATORS;
D O I
10.1007/s00521-018-3704-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When convolutional neural networks are used to tackle learning problems based on music or other time series, raw one-dimensional data are commonly preprocessed to obtain spectrogram or mel-spectrogram coefficients, which are then used as input to the actual neural network. In this contribution, we investigate, both theoretically and experimentally, the influence of this pre-processing step on the network's performance and pose the question whether replacing it by applying adaptive or learned filters directly to the raw data can improve learning success. The theoretical results show that approximately reproducing mel-spectrogram coefficients by applying adaptive filters and subsequent time-averaging on the squared amplitudes is in principle possible. We also conducted extensive experimental work on the task of singing voice detection in music. The results of these experiments show that for classification based on convolutional neural networks the features obtained from adaptive filter banks followed by time-averaging the squared modulus of the filters' output perform better than the canonical Fourier transform-based mel-spectrogram coefficients. Alternative adaptive approaches with center frequencies or time-averaging lengths learned from training data perform equally well.
引用
收藏
页码:941 / 954
页数:14
相关论文
共 50 条
  • [41] Convolutional Neural Networks Applied for Skin Lesion Segmentation
    Araujo, Graziela Silva
    Camara-Chavez, Guillermo
    Oliveira, Roberta B.
    2021 XLVII LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2021), 2021,
  • [42] SampleCNN: End-to-End Deep Convolutional Neural Networks Using Very Small Filters for Music Classification
    Lee, Jongpil
    Park, Jiyoung
    Kim, Keunhyoung Luke
    Nam, Juhan
    APPLIED SCIENCES-BASEL, 2018, 8 (01):
  • [43] Object Tracking with Convolutional Neural Networks and Kernelized Correlation Filters
    Li, Dongxuan
    Chen, Wenjie
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 1039 - 1044
  • [44] Face detection using convolutional neural networks and Gabor filters
    Kwolek, B
    ARTIFICIAL NEURAL NETWORKS: BIOLOGICAL INSPIRATIONS - ICANN 2005, PT 1, PROCEEDINGS, 2005, 3696 : 551 - 556
  • [45] Crop Anomaly Identification with Color Filters and Convolutional Neural Networks
    Nardari, Guilherme V.
    Romero, Roseli A. F.
    Guizilini, Vitor C.
    Mareco, Willy E. C.
    Milori, Debora M. B. P.
    Villas-Boas, Paulino R.
    Dias Santos, Igor Araujo
    15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, : 363 - 369
  • [46] Filters in Convolutional Neural Networks as Independent Detectors of Visual Concepts
    Hristov, Anton
    Nisheva, Maria
    Dimov, Dimo
    COMPUTER SYSTEMS AND TECHNOLOGIES, 2019, : 110 - 117
  • [47] Improving Performance of Convolutional Neural Networks by Separable Filters on GPU
    Kang, Hao-Ping
    Lee, Che-Rung
    EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 638 - 649
  • [48] Compressing Convolutional Neural Networks by Pruning Density Peak Filters
    Jang, Yunseok
    Lee, Sangyoun
    Kim, Jaeseok
    IEEE ACCESS, 2021, 9 : 8278 - 8285
  • [49] Reborn Filters: Pruning Convolutional Neural Networks with Limited Data
    Tang, Yehui
    You, Shan
    Xu, Chang
    Han, Jin
    Qian, Chen
    Shi, Boxin
    Xu, Chao
    Zhang, Changshui
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5972 - 5980
  • [50] Training Strategies for Convolutional Neural Networks with Transformed Input
    Khandani, Masoumeh Kalantari
    Mikhael, Wasfy B.
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 1058 - 1061