Two-level fusion-based acoustic scene classification

被引:16
|
作者
Waldekar, Shefali [1 ]
Saha, Goutam [1 ]
机构
[1] IIT Kharagpur, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
关键词
Environmental acoustics; Hierarchical classification; Score-fusion; Spectral features; Texture features; OF-FRAMES APPROACH; SUFFICIENT MODEL; RECOGNITION; FEATURES;
D O I
10.1016/j.apacoust.2020.107502
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Growing demands from applications like surveillance, archiving, and context-aware devices have fuelled research towards efficient extraction of useful information from environmental sounds. Assigning a textual label to an audio segment based on the general characteristics of locations or situations is dealt with in acoustic scene classification (ASC). Because of the different nature of audio scenes, a single feature-classifier pair may not efficiently discriminate among environments. Also, the acoustic scenes might vary with the problem under investigation. However, for most of the ASC applications, rather than giving explicit scene labels (like home, park, etc.) a general estimate of the type of surroundings (e.g., indoor or outdoor) might be enough. In this paper, we propose a two-level hierarchical framework for ASC wherein finer labels follow coarse classification. At the first level, texture features extracted from time-frequency representation of the audio samples are used to generate the coarse labels. The system then explores combinations of six well-known spectral features, successfully used in different audio processing fields for second level classification to give finer details of the audio scene. The performance of the proposed system is compared with baseline methods using detection and classification of acoustic scenes and events (DCASE, 2016 and 2017) ASC databases, and found to be superior in terms of classification accuracy. Additionally, the proposed hierarchical method provides important intermediate results as coarse labels that may be useful in certain applications. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Artificial Disc Replacement Combined With Fusion Versus Two-Level Fusion in Cervical Two-Level Disc Disease Point of View
    McAfee, Paul
    SPINE, 2009, 34 (11) : 1160 - 1161
  • [22] A Two-level KNN based Teaching Web Pages Classification Model
    Ma, Dan
    Wang, Hanhu
    Chen, Mei
    2009 INTERNATIONAL CONFERENCE ON NETWORKING AND DIGITAL SOCIETY, VOL 1, PROCEEDINGS, 2009, : 190 - 193
  • [23] Two-level fuzzy evaluation for classification of credits
    Qiong Wang
    Jin-xian Chen
    Journal of Zhejiang University-SCIENCE A, 2002, 3 (3): : 311 - 314
  • [24] A TWO-LEVEL APPROACH TO WEB GENRE CLASSIFICATION
    Waltinger, Ulli
    Mehler, Alexander
    Wegner, Armin
    WEBIST 2009: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2009, : 689 - 692
  • [25] Texture classification by a two-level hybrid scheme
    Pok, G
    Liu, JC
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VII, 1998, 3656 : 614 - 622
  • [26] A Two-level Classification Method for Attacks on the Network
    Li, Yanyan
    Jia, Zhichun
    Han, Qiuyang
    Xing, Xing
    2019 34RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2019, : 279 - 284
  • [27] A Two-Level Rectification Attention Network for Scene Text Recognition
    Wu, Lintai
    Xu, Yong
    Hou, Junhui
    Chen, C. L. Philip
    Liu, Cheng-Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2404 - 2414
  • [28] Two-level fuzzy evaluation for classification of credits
    Wang, Qiong
    Chen, Jin-Xian
    Journal of Zhejinag University: Science, 2002, 3 (03): : 311 - 314
  • [29] Data and Decision Level Fusion-Based Crack Detection for Compressor Blade Using Acoustic and Vibration Signal
    Song, Di
    Ma, Tianchi
    Li, Yang
    Xu, Feiyun
    IEEE SENSORS JOURNAL, 2022, 22 (12) : 12209 - 12218
  • [30] Decision fusion-based approach for content-based image classification
    Thepade S.
    Das R.
    Ghosh S.
    Das, Rik (rikdas78@gmail.com), 2017, Emerald Group Holdings Ltd. (10) : 310 - 331