MusicNet: Compact Convolutional Neural Network for Real-time Background Music Detection

被引:0
|
作者
Reddy, Chandan K. A. [1 ]
Gopal, Vishak [1 ]
Dubey, Harishchandra [1 ]
Matusevych, Sergiy [1 ]
Cutler, Ross [1 ]
Aichner, Robert [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
来源
关键词
Background Music Detection; Acoustic Event Detection; Instrumental Music; Convolutional Neural Networks; In-Model Featurization;
D O I
10.21437/Interspeech.2022-864
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With the recent growth of remote work, online meetings often encounter challenging audio contexts such as background noise, music, and echo. Accurate real-time detection of music events can help to improve the user experience. In this paper, we present MusicNet, a compact neural model for detecting background music in the real-time communications pipeline. In video meetings, music frequently co-occurs with speech and background noises, making the accurate classification quite challenging. We propose a compact convolutional neural network core preceded by an in-model featurization layer. MusicNet takes 9 seconds of raw audio as input and does not require any model-specific featurization in the product stack. We train our model on the balanced subset of the Audio Set [1] data and validate it on 1000 crowd-sourced real test clips. Finally, we compare MusicNet performance with 20 state-of-the-art models. MusicNet has a true positive rate (TPR) of 81.3% at a 0.1% false-positive rate (FPR), which is significantly better than state-of-the-art models included in our study. MusicNet is also 10x smaller and has 4x faster inference than the best-performing models we benchmarked.
引用
收藏
页码:4162 / 4166
页数:5
相关论文
共 50 条
  • [31] Convolutional neural networks for real-time epileptic seizure detection
    Achilles, Felix
    Tombari, Federico
    Belagiannis, Vasileios
    Loesch, Anna Mira
    Noachtar, Soheyl
    Navab, Nassir
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2018, 6 (03): : 264 - 269
  • [32] Convolutional Neural Networks for Real-Time and Wireless Damage Detection
    Avci, Onur
    Abdeljaber, Osama
    Kiranyaz, Serkan
    Inman, Daniel
    DYNAMICS OF CIVIL STRUCTURES, VOL 2, IMAC 2019, 2020, : 129 - 136
  • [33] Real-time arrhythmia detection using convolutional neural networks
    Vu, Thong
    Petty, Tyler
    Yakut, Kemal
    Usman, Muhammad
    Xue, Wei
    Haas, Francis M.
    Hirsh, Robert A.
    Zhao, Xinghui
    FRONTIERS IN BIG DATA, 2023, 6
  • [34] Real-Time Pedestrian Detection Using Convolutional Neural Networks
    Kuang, Ping
    Ma, Tingsong
    Li, Fan
    Chen, Ziwei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (11)
  • [35] Real-Time Grasp Detection Using Convolutional Neural Networks
    Redmon, Joseph
    Angelova, Anelia
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1316 - 1322
  • [36] Real-time Music Tracking based on a Weightless Neural Network
    de Souza, Diego F. P.
    Franca, Felipe M. G.
    Lima, Priscila M. V.
    2015 9TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS CISIS 2015, 2015, : 64 - 69
  • [37] Real-Time Ground Vehicle Detection in Aerial Infrared Imagery Based on Convolutional Neural Network
    Liu, Xiaofei
    Yang, Tao
    Li, Jing
    ELECTRONICS, 2018, 7 (06)
  • [38] Deep convolutional neural network based real-time abnormal behavior detection in social networks
    Mavaluru, Dinesh
    Mubarakali, Azath
    Narapureddy, Bayapa Reddy
    Ramakrishnan, Jayabrabu
    John, Rajan
    Ravishankar, Nadana
    Karthika, P.
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 111
  • [39] The analysis of Iris image acquisition and real-time detection system using convolutional neural network
    Yanru Liu
    Jiali Xu
    Austin Lin Yee
    The Journal of Supercomputing, 2024, 80 (4) : 4500 - 4532
  • [40] A novel real-time fall detection method based on head segmentation and convolutional neural network
    Yao, Chenguang
    Hu, Jun
    Min, Weidong
    Deng, Zhifeng
    Zou, Song
    Min, Weiqiong
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 1939 - 1949