Music Auto-tagging Algorithm Based on Deep Analysis on Labels

被引:0
|
作者
Wang Z. [1 ]
Zhang R. [1 ]
Gao Y. [1 ]
Xiao Y. [1 ]
机构
[1] School of Software Engineering, South China University of Technology, Guangzhou, 510006, Guangdong
关键词
Deep neural network; Music auto-tagging; Music label vector;
D O I
10.12141/j.issn.1000-565X.180273
中图分类号
学科分类号
摘要
Deep neural network algorithms have made breakthroughs in automatic labeling tasks, but it is still hard to solve the noise data problem in real music dataset. A music auto-tagging algorithm based on deep analysis on labels (DAL) which captures the potential relationship between audio features and music tags was proposed. The algorithm first extracts the audio features through a multi-level convolutional network, and then learn the vector representation of music tags to reduce the adverse effects of noise data. The experimental results show that the proposed algorithm can achieve higher mean area under receiver operating characteristic curve (AUROCC) and outperform other auto-tagging methods. © 2019, Editorial Department, Journal of South China University of Technology. All right reserved.
引用
收藏
页码:71 / 76
页数:5
相关论文
共 28 条
  • [11] Wang S.-J., Chen N., Annotating music with hybrid discriminative restricted Boltzmann machines, Journal of East China University of Science and Techno-logy(Natural Science Edition), 43, 4, pp. 540-545, (2017)
  • [12] Dieleman S., Schrauwen B., End-to-end learning for music audio, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6964-6968, (2014)
  • [13] Li P., Chen N., Feature learning for music auto-tagging using denoising autoencoder, Journal of East China University of Science and Technology(Natural Science Edition), 43, 2, pp. 241-247, (2017)
  • [14] Cantador I., Konstas I., Jose J.M., Categorising social tags to improve folksonomy-based recommendations, Web Semantics: Science, Services and Agents on the World Wide Web, 9, 1, pp. 1-15, (2011)
  • [15] Zeiler M.D., Fergus R., Visualizing and understan-ding convolutional networks, Proceedings of the 13th European Conference on Computer Vision, pp. 818-833, (2014)
  • [16] He K., Zhang X., Ren S., Et al., Deep residual learning for image recognition, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [17] Szegedy C., Liu W., Jia Y., Et al., Going deeper with convolutions, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1-9, (2015)
  • [18] Choi K., Joo D., Kim J., Kapre: on-GPU audio preprocessing layers for a quick implementation of deep neural network models with Keras
  • [19] Ioffe S., Szegedy C., Batch normalization: accelerating deep network training by reducing internal covariate shift, Proceedings of the 32nd International Conference on Machine Learning, pp. 448-456, (2015)
  • [20] Clevert D.A., Unterthiner T., Hochreiter S., Fast and accurate deep network learning by exponential linear units(elus)