Environmental sound recognition using short-time feature aggregation

被引:0
|
作者
Gerard Roma
Perfecto Herrera
Waldo Nogueira
机构
[1] Georgia Institute of Technology,School of Literature, Media and Communication
[2] Universitat Pompeu Fabra,Music Technology Group
[3] Medical University Hannover and Cluster of Excellence Hearing4all,Department of Otolaryngology
关键词
Audio databases; Event detection; Environmental sound recognition; Audio features; Recurrence quantification analysis; Pattern recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Recognition of environmental sound is usually based on two main architectures, depending on whether the model is trained with frame-level features or with aggregated descriptions of acoustic scenes or events. The former architecture is appropriate for applications where target categories are known in advance, while the later affords a less supervised approach. In this paper, we propose a framework for environmental sound recognition based on blind segmentation and feature aggregation. We describe a new set of descriptors, based on Recurrence Quantification Analysis (RQA), which can be extracted from the similarity matrix of a time series of audio descriptors. We analyze their usefulness for recognition of acoustic scenes and events in addition to standard feature aggregation. Our results show the potential of non-linear time series analysis techniques for dealing with environmental sounds.
引用
收藏
页码:457 / 475
页数:18
相关论文
共 50 条
  • [41] Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition
    Himawan, Ivan
    Motlicek, Petr
    Sridharan, Sridha
    Dean, David
    Tjondronegoro, Dian
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 741 - 745
  • [42] Personal sound zones in the short-time Fourier transform domain with relaxed reverberation
    Tang, Jun
    Zhu, Wenye
    Li, Xiaofei
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2025, 157 (02): : 778 - 796
  • [43] Altmetrics versus traditional bibliometrics Short-time lag and short-time life?
    Fassoulaki, Argyro
    Vassi, Aimilia
    Kardasis, Antonios
    Chantziara, Vasiliki
    EUROPEAN JOURNAL OF ANAESTHESIOLOGY, 2020, 37 (10) : 944 - 946
  • [44] SHORT-TIME CORRELATION FUNCTION AND SHORT-TIME ENERGY SPECTRUM OF A RANDOM PROCESS
    DENISENK.AN
    TELECOMMUNICATIONS AND RADIO ENGINEER-USSR, 1968, (10): : 65 - &
  • [45] Maximizing environmental sound recognition and speech intelligibility using time-frequency masking
    Johnson, Eric M.
    Healy, Eric W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
  • [46] Training using Short-time Features for OSA Discrimination
    Sepulveda-Cano, L. M.
    Alvarez-Meza, A. M.
    Castellanos-Dominguez, G.
    2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 9 - 12
  • [47] Using short-time Fourier Transform in machinery diagnosis
    Safizadeh, MS
    Lakis, AA
    Thomas, M
    COMADEM '99, PROCEEDINGS, 1999, : 125 - 130
  • [48] Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform
    Makeyev, Oleksandr
    Sazonov, Edward
    Schuckers, Stephanie
    Melanson, Ed
    Neuman, Michael
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1601 - +
  • [49] Short-time Domain Feature Fxtraction Method of Sound Signal of High Voltage Circuit Breaker Mechanical Fault Based on KS Test
    Wang, Xiaoming
    Lin, Xiangyu
    Li, Wenwei
    Li, Haiyong
    2021 8TH INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION, IFEEA, 2021, : 583 - 587
  • [50] Fault feature extraction of rolling element bearings based on short-time processing
    Chen, Fan
    JOURNAL OF VIBROENGINEERING, 2022, 24 (02) : 317 - 330