Environmental sound recognition using short-time feature aggregation

被引：0

作者：

Gerard Roma

Perfecto Herrera

Waldo Nogueira

机构：

[1] Georgia Institute of Technology,School of Literature, Media and Communication

[2] Universitat Pompeu Fabra,Music Technology Group

[3] Medical University Hannover and Cluster of Excellence Hearing4all,Department of Otolaryngology

来源：

Journal of Intelligent Information Systems | 2018年 / 51卷

关键词：

Audio databases; Event detection; Environmental sound recognition; Audio features; Recurrence quantification analysis; Pattern recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recognition of environmental sound is usually based on two main architectures, depending on whether the model is trained with frame-level features or with aggregated descriptions of acoustic scenes or events. The former architecture is appropriate for applications where target categories are known in advance, while the later affords a less supervised approach. In this paper, we propose a framework for environmental sound recognition based on blind segmentation and feature aggregation. We describe a new set of descriptors, based on Recurrence Quantification Analysis (RQA), which can be extracted from the similarity matrix of a time series of audio descriptors. We analyze their usefulness for recognition of acoustic scenes and events in addition to standard feature aggregation. Our results show the potential of non-linear time series analysis techniques for dealing with environmental sounds.

引用

页码：457 / 475

页数：18

共 50 条

[21] Recognition of Short-Time Heavy Rainfall Based on Deep Learning
Lu Z.
Ren Y.
Sun X.
Jia H.
Lu, Zhiying (luzy@tju.edu.cn), 2018, Tianjin University (51): : 111 - 119
[22] THE SHORT-TIME MODIFIED COHERENCE REPRESENTATION AND NOISY SPEECH RECOGNITION
MANSOUR, D
JUANG, BH
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (06): : 795 - 804
[23] SHORT-TIME HEMODIAFILTRATION USING POLYMETHYLMETHACRYLATE HEMODIAFILTER
OTA, K
SUZUKI, T
OZAKU, Y
HOSHINO, T
AGISHI, T
SUGINO, N
TRANSACTIONS AMERICAN SOCIETY FOR ARTIFICIAL INTERNAL ORGANS, 1978, 24 : 454 - 457
[24] OFDM Time Synchronization Method of Underwater Sound Based on Short-time Fourier Transform
Zheng Caiyun
Ma Xuefei
2016 IEEE/OES CHINA OCEAN ACOUSTICS SYMPOSIUM (COA), 2016,
[25] Intra-pulse modulation recognition using short-time ramanujan Fourier transform spectrogram
Ma, Xiurong
Liu, Dan
Shan, Yunlong
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2017, : 1 - 11
[26] Human action recognition using short-time motion energy template images and PCANet features
Abdelbaky, Amany
Aly, Saleh
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12561 - 12574
[27] Intra-pulse modulation recognition using short-time ramanujan Fourier transform spectrogram
Xiurong Ma
Dan Liu
Yunlong Shan
EURASIP Journal on Advances in Signal Processing, 2017
[28] Joint short-time speaker recognition and tracking using sparsity-based source detection
Guo, Yao
Zhu, Hongyan
ACTA ACUSTICA, 2023, 7
[29] Human action recognition using short-time motion energy template images and PCANet features
Amany Abdelbaky
Saleh Aly
Neural Computing and Applications, 2020, 32 : 12561 - 12574
[30] HARMONIZING EFFECT USING SHORT-TIME TIME-REVERSAL
Kim, Hyung-Suk
Smith, Julius O.
DAFX-15: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, 2015, : 81 - 86

← 1 2 3 4 5 →