Environmental sound recognition using short-time feature aggregation

被引:0
|
作者
Gerard Roma
Perfecto Herrera
Waldo Nogueira
机构
[1] Georgia Institute of Technology,School of Literature, Media and Communication
[2] Universitat Pompeu Fabra,Music Technology Group
[3] Medical University Hannover and Cluster of Excellence Hearing4all,Department of Otolaryngology
来源
Journal of Intelligent Information Systems | 2018年 / 51卷
关键词
Audio databases; Event detection; Environmental sound recognition; Audio features; Recurrence quantification analysis; Pattern recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Recognition of environmental sound is usually based on two main architectures, depending on whether the model is trained with frame-level features or with aggregated descriptions of acoustic scenes or events. The former architecture is appropriate for applications where target categories are known in advance, while the later affords a less supervised approach. In this paper, we propose a framework for environmental sound recognition based on blind segmentation and feature aggregation. We describe a new set of descriptors, based on Recurrence Quantification Analysis (RQA), which can be extracted from the similarity matrix of a time series of audio descriptors. We analyze their usefulness for recognition of acoustic scenes and events in addition to standard feature aggregation. Our results show the potential of non-linear time series analysis techniques for dealing with environmental sounds.
引用
收藏
页码:457 / 475
页数:18
相关论文
共 50 条
  • [21] Recognition of Short-Time Heavy Rainfall Based on Deep Learning
    Lu Z.
    Ren Y.
    Sun X.
    Jia H.
    Lu, Zhiying (luzy@tju.edu.cn), 2018, Tianjin University (51): : 111 - 119
  • [22] THE SHORT-TIME MODIFIED COHERENCE REPRESENTATION AND NOISY SPEECH RECOGNITION
    MANSOUR, D
    JUANG, BH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (06): : 795 - 804
  • [23] SHORT-TIME HEMODIAFILTRATION USING POLYMETHYLMETHACRYLATE HEMODIAFILTER
    OTA, K
    SUZUKI, T
    OZAKU, Y
    HOSHINO, T
    AGISHI, T
    SUGINO, N
    TRANSACTIONS AMERICAN SOCIETY FOR ARTIFICIAL INTERNAL ORGANS, 1978, 24 : 454 - 457
  • [24] OFDM Time Synchronization Method of Underwater Sound Based on Short-time Fourier Transform
    Zheng Caiyun
    Ma Xuefei
    2016 IEEE/OES CHINA OCEAN ACOUSTICS SYMPOSIUM (COA), 2016,
  • [25] Intra-pulse modulation recognition using short-time ramanujan Fourier transform spectrogram
    Ma, Xiurong
    Liu, Dan
    Shan, Yunlong
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2017, : 1 - 11
  • [26] Human action recognition using short-time motion energy template images and PCANet features
    Abdelbaky, Amany
    Aly, Saleh
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12561 - 12574
  • [27] Intra-pulse modulation recognition using short-time ramanujan Fourier transform spectrogram
    Xiurong Ma
    Dan Liu
    Yunlong Shan
    EURASIP Journal on Advances in Signal Processing, 2017
  • [28] Joint short-time speaker recognition and tracking using sparsity-based source detection
    Guo, Yao
    Zhu, Hongyan
    ACTA ACUSTICA, 2023, 7
  • [29] Human action recognition using short-time motion energy template images and PCANet features
    Amany Abdelbaky
    Saleh Aly
    Neural Computing and Applications, 2020, 32 : 12561 - 12574
  • [30] HARMONIZING EFFECT USING SHORT-TIME TIME-REVERSAL
    Kim, Hyung-Suk
    Smith, Julius O.
    DAFX-15: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, 2015, : 81 - 86