Acoustic Classification of Cat Breed Based on Time and Frequency Domain Features

被引:2
|
作者
Raccagni, William [1 ]
Ntalampiras, Stavros [1 ]
机构
[1] Univ Milan, Milan, Italy
关键词
D O I
10.23919/FRUCT53335.2021.9599975
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emerging field of Bioacoustics has been presenting significant research activity lately, and thanks to the use of machine learning methods, several tools and methodologies have been established for identifying certain patterns and meanings in animal vocalizations. Animal sounds can vary over time in intensity and patterns produced between different breeds of the same species, both for physiological reasons and for different emotional states and needs. Pets, such as dogs and cats, are no exception, thus allowing a vocal distinction between breeds. This article studies classification of the cat breed, in particular on the Maine Coon and European Shorthair breed, based on the public audio dataset "CatMeows". To this end, we employed features coming from time and frequency domain capturing relevant information as regard to the present audio structure. Subsequently, audio pattern recognition was carried out by means of k-means clustering, k-NN, and multilayer perceptron learning models. After extensive experiments, we obtained very promising results , with an average accuracy that runs around 98%. In particular, time-domain features presented a strong contribution, as demonstrated by the results using k-means.
引用
收藏
页码:184 / 189
页数:6
相关论文
共 50 条
  • [31] Confidence measures for acoustic detection of film slates based on time-domain features
    Schlosser, Markus S.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 137 - 140
  • [32] Early Chatter Detection based on Logistic Regression with Time and Frequency Domain Features
    Ding, Longyang
    Sun, Yuxin
    Xiong, Zhenhua
    2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2017, : 1052 - 1057
  • [33] Research on Temporal Features of LEMP Based on Laplace Wavelet in Time and Frequency Domain
    Li, Qin
    Zhong, Jianwei
    Ai, Qing
    Gao, Shihong
    MIPPR 2015: MULTISPECTRAL IMAGE ACQUISITION, PROCESSING, AND ANALYSIS, 2015, 9811
  • [34] A Dementia Classification Framework Using Frequency and Time-Frequency Features Based on EEG Signals
    Durongbhan, Pholpat
    Zhao, Yifan
    Chen, Liangyu
    Zis, Panagiotis
    De Marco, Matteo
    Unwin, Zoe C.
    Venneri, Annalena
    He, Xiongxiong
    Li, Sheng
    Zhao, Yitian
    Blackburn, Daniel J.
    Sarrigiannis, Ptolemaios G.
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2019, 27 (05) : 826 - 835
  • [35] Online multi-object tracking based on time and frequency domain features
    Nazarloo, Mahbubeh
    Yadollahzadeh-Tabari, Meisam
    Motameni, Homayun
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2022, 16 (01): : 19 - 28
  • [36] CLASSIFICATION OF THE EXTENT OF WALL THINNING IN PIPES BASED ON SIMULATIONS IN THE TIME AND FREQUENCY DOMAIN
    Alobaidi, Wissam M.
    Sandgren, Eric
    PROCEEDINGS OF THE ASME PRESSURE VESSELS AND PIPING CONFERENCE, 2016, VOL 5, 2017,
  • [37] Real-Time Vehicle Classification Based on Frequency Domain Energy Spectrum
    Zhang, Pengfei
    Li, Haijian
    Dong, Honghui
    Jia, Limin
    Jin, Maojing
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 539 - 546
  • [38] Research of brake sound acoustic features extraction based on frequency-domain blind deconvolution
    Pan, Nan
    Yi, Zeguang
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 2892 - 2895
  • [39] Underwater shell type target classification using time-frequency features of acoustic backscatter signals
    Abeysekera, SS
    PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1998, : 593 - 596
  • [40] Classification of EEG Signals Using Time Domain Features
    Yazici, Mustafa
    Ulutas, Mustafa
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 2358 - 2361