TOWARDS BLIND QUALITY ASSESSMENT OF CONCERT AUDIO RECORDINGS USING DEEP NEURAL NETWORKS

被引:0
|
作者
Simou, Nikonas [1 ,3 ]
Mastorakis, Yannis [1 ]
Stefanakis, Nikolaos [1 ,2 ]
机构
[1] Fdn Res & Technol Hellas, Inst Comp Sci, Iraklion 70013, Crete, Greece
[2] Hellen Mediterranean Univ, Dept Mus Technol & Acoust, Rethimnon, Greece
[3] Univ Crete, Dept Comp Sci, Iraklion, Greece
关键词
user generated content; quality assessment; deep neural networks;
D O I
10.1109/icassp40776.2020.9053356
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Live music audio and video recordings represent a large percentage of the huge amount of User Generated Content (UGC) that is available on the internet today. Applications and services related to the management and consumption of this content may significantly benefit from tools able to produce a subjective score of the audio quality. In this work, we apply different Deep Neural Network (DNN) architectures to a simple binary classification problem, that of deciding whether a musical recording is user-generated or of professional quality. Showing that we are able to efficiently address this binary classification problem, we gain some useful insight about factors that may assist the design and affect the performance of a future system that would be able to address the more general problem of blind audio quality assessment.
引用
收藏
页码:3477 / 3481
页数:5
相关论文
共 50 条
  • [1] Towards Blind Audio Quality Assessment using a Convolutional-Recurrent Neural Network
    Zheng, Xiguang
    Zhang, Chen
    2021 13TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2021, : 91 - 96
  • [2] INFERRING SOCIAL CONTEXTS FROM AUDIO RECORDINGS USING DEEP NEURAL NETWORKS
    Asgari, Meysam
    Shafran, Izhak
    Bayestehtashk, Alireza
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [3] Blind Quality Assessment of Multiply Distorted Images Using Deep Neural Networks
    Wang, Zhongling
    Athar, Shahrukh
    Wang, Zhou
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 89 - 101
  • [4] End-to-End Blind Image Quality Assessment Using Deep Neural Networks
    Ma, Kede
    Liu, Wentao
    Zhang, Kai
    Duanmu, Zhengfang
    Wang, Zhou
    Zuo, Wangmeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1202 - 1213
  • [5] End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks
    Liu, Wentao
    Duanmu, Zhengfang
    Wang, Zhou
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 546 - 554
  • [6] Non-intrusive method for audio quality assessment of lossy-compressed music recordings using convolutional neural networks
    Kasperuk, Aleksandra
    Zielinski, Slawomir Krzysztof
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2024, 70 (02) : 331 - 339
  • [7] Blind Calibration of Air Quality Wireless Sensor Networks Using Deep Neural Networks
    Veiga, Tiago
    Ljunggren, Erling
    Bach, Kerstin
    Akselsen, Sigmund
    2021 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (IEEE COINS 2021), 2021, : 111 - 116
  • [8] Echocardiographic Image Quality Assessment Using Deep Neural Networks
    Labs, Robert B.
    Zolgharni, Massoud
    Loo, Jonathan P.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2021), 2021, 12722 : 488 - 502
  • [9] Synthesizing Game Audio Using Deep Neural Networks
    McDonagh, Aoife
    Lemley, Joseph
    Cassidy, Ryan
    Corcoran, Peter
    2018 IEEE GAMES, ENTERTAINMENT, MEDIA CONFERENCE (GEM), 2018, : 312 - 315
  • [10] Audio Event Classification Using Deep Neural Networks
    Kons, Zvi
    Toledo-Ronen, Orith
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1481 - 1485