Two-level fusion-based acoustic scene classification

被引:16
|
作者
Waldekar, Shefali [1 ]
Saha, Goutam [1 ]
机构
[1] IIT Kharagpur, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
关键词
Environmental acoustics; Hierarchical classification; Score-fusion; Spectral features; Texture features; OF-FRAMES APPROACH; SUFFICIENT MODEL; RECOGNITION; FEATURES;
D O I
10.1016/j.apacoust.2020.107502
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Growing demands from applications like surveillance, archiving, and context-aware devices have fuelled research towards efficient extraction of useful information from environmental sounds. Assigning a textual label to an audio segment based on the general characteristics of locations or situations is dealt with in acoustic scene classification (ASC). Because of the different nature of audio scenes, a single feature-classifier pair may not efficiently discriminate among environments. Also, the acoustic scenes might vary with the problem under investigation. However, for most of the ASC applications, rather than giving explicit scene labels (like home, park, etc.) a general estimate of the type of surroundings (e.g., indoor or outdoor) might be enough. In this paper, we propose a two-level hierarchical framework for ASC wherein finer labels follow coarse classification. At the first level, texture features extracted from time-frequency representation of the audio samples are used to generate the coarse labels. The system then explores combinations of six well-known spectral features, successfully used in different audio processing fields for second level classification to give finer details of the audio scene. The performance of the proposed system is compared with baseline methods using detection and classification of acoustic scenes and events (DCASE, 2016 and 2017) ASC databases, and found to be superior in terms of classification accuracy. Additionally, the proposed hierarchical method provides important intermediate results as coarse labels that may be useful in certain applications. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Two-Stage Fusion-Based Audiovisual Remote Sensing Scene Classification
    Wang, Yaming
    Liu, Yiyang
    Huang, Wenqing
    Ye, Xiaoping
    Jiang, Mingfeng
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [2] Two-Level Feature Representation for Aerial Scene Classification
    Gan, Jinrui
    Li, Qingyong
    Zhang, Zhen
    Wang, Jianzhu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (11) : 1626 - 1630
  • [3] A new method of construction waste classification based on two-level fusion
    Song, Lin
    Zhao, Huixuan
    Ma, Zongfang
    Song, Qi
    PLOS ONE, 2022, 17 (12):
  • [4] Fusion-based holistic road scene understanding
    Huang, Wenqi
    Zhang, Fuzheng
    Xu, Aidong
    Chen, Huajun
    Li, Peng
    JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1623 - 1628
  • [5] Semisupervised Two-Level Fusion-Based Autoencoded Approach for Low-Cost Domain Adaptation of Remotely Sensed Images
    Chakraborty, Shounak
    Roy, Moumita
    Melganie, Farid
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (07) : 1041 - 1045
  • [6] Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems
    Lopez-Cozar, Ramon
    Callejas, Zoraida
    Kroul, Martin
    Nouza, Jan
    Silovsky, Jan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 617 - +
  • [7] A two-level classification-based color constancy
    Mohammad Mehdi Faghih
    Mohsen Ebrahimi Moghaddam
    Signal, Image and Video Processing, 2015, 9 : 1299 - 1316
  • [8] A two-level classification-based color constancy
    Faghih, Mohammad Mehdi
    Moghaddam, Mohsen Ebrahimi
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (06) : 1299 - 1316
  • [9] Remote Sensing Scene Classification Based on Decision-Level Fusion
    Li, Xiaobin
    Jiang, Bitao
    Sun, Tong
    Wang, Shengjin
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 393 - 397
  • [10] Scene Parsing and Fusion-Based Continuous Traversable Region Formation
    Xiao, Xuhong
    Ng, Gee Wah
    Tan, Yuan Sin
    Chuan, Yeo Ye
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I, 2015, 9008 : 383 - 398