MusicNet: Compact Convolutional Neural Network for Real-time Background Music Detection

被引:0
|
作者
Reddy, Chandan K. A. [1 ]
Gopal, Vishak [1 ]
Dubey, Harishchandra [1 ]
Matusevych, Sergiy [1 ]
Cutler, Ross [1 ]
Aichner, Robert [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
来源
关键词
Background Music Detection; Acoustic Event Detection; Instrumental Music; Convolutional Neural Networks; In-Model Featurization;
D O I
10.21437/Interspeech.2022-864
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With the recent growth of remote work, online meetings often encounter challenging audio contexts such as background noise, music, and echo. Accurate real-time detection of music events can help to improve the user experience. In this paper, we present MusicNet, a compact neural model for detecting background music in the real-time communications pipeline. In video meetings, music frequently co-occurs with speech and background noises, making the accurate classification quite challenging. We propose a compact convolutional neural network core preceded by an in-model featurization layer. MusicNet takes 9 seconds of raw audio as input and does not require any model-specific featurization in the product stack. We train our model on the balanced subset of the Audio Set [1] data and validate it on 1000 crowd-sourced real test clips. Finally, we compare MusicNet performance with 20 state-of-the-art models. MusicNet has a true positive rate (TPR) of 81.3% at a 0.1% false-positive rate (FPR), which is significantly better than state-of-the-art models included in our study. MusicNet is also 10x smaller and has 4x faster inference than the best-performing models we benchmarked.
引用
收藏
页码:4162 / 4166
页数:5
相关论文
共 50 条
  • [41] Transfer learning for real-time crater detection on asteroids using a Fully Convolutional Neural Network
    Latorre, F.
    Spiller, D.
    Sasidharan, S. T.
    Basheer, S.
    Curti, F.
    ICARUS, 2023, 394
  • [42] Road Image Enhancement and Real-Time Vehicle Detection at Night Based on Convolutional Neural Network
    Dong, Tao
    Engineering Intelligent Systems, 2024, 32 (02): : 169 - 178
  • [43] Real-time fabric defect detection based on multi-scale convolutional neural network
    Zhao, Shuxuan
    Yin, Li
    Zhang, Jie
    Wang, Junliang
    Zhong, Ray
    IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2020, 2 (04) : 189 - 196
  • [44] A novel real-time fall detection method based on head segmentation and convolutional neural network
    Chenguang Yao
    Jun Hu
    Weidong Min
    Zhifeng Deng
    Song Zou
    Weiqiong Min
    Journal of Real-Time Image Processing, 2020, 17 : 1939 - 1949
  • [45] The analysis of Iris image acquisition and real-time detection system using convolutional neural network
    Liu, Yanru
    Xu, Jiali
    Yee, Austin Lin
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4500 - 4532
  • [46] Real-Time Pain Detection Using Deep Convolutional Neural Network for Facial Expression and Motion
    Pikulkaew, Kornprom
    Boonchieng, Waraporn
    Boonchieng, Ekkarat
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 2, 2023, 448 : 341 - 349
  • [47] ECDet: An Efficient Convolutional Network for Real-time Object Detection
    Wang, Jie
    Zhou, Quan
    Cong, Dechun
    Jin, Xin
    Ou, Weihua
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2020, 2020, 11574
  • [48] Real-time Detection of Multilabel Image Artifacts In an Ophthalmic Instrument Using a Convolutional Neural Network/Deep Neural Network Model
    Sarver, Edwin J.
    Hall, Max
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2021, 62 (08)
  • [49] A neural network approach for the real-time detection of faults
    Yahya Chetouani
    Stochastic Environmental Research and Risk Assessment, 2008, 22 : 339 - 349
  • [50] A neural network approach for the real-time detection of faults
    Chetouani, Yahya
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2008, 22 (03) : 339 - 349