Robust audio fingerprinting based on GammaChirp frequency cepstral coefficients and chroma

被引:3
|
作者
Chen, N. [1 ]
Xiao, H. D. [2 ]
Zhu, J. [3 ]
机构
[1] E China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China
[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
10.1049/el.2013.3554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel auditory feature that combines an auditory model and music theory is proposed for audio fingerprinting. First, the input audio is filtered by a GammaChirp (GC) filterbank to model the cochlear frequency selectivity. Then, the output of the filterbank is downsampled and decorrelated by a discrete cosine transform to obtain the GammaChirp frequency cepstral coefficients (GCFCCs). Next, some lowest order GCFCCs are projected onto the chroma to characterise both melodic and harmonic information of the input. Finally, non-negative matrix factorisation is applied to the chroma matrix to reduce its dimension while maintaining its discriminative power. The experimental results illustrate that the proposed scheme achieves a stabler identification rate and lower computational complexity than the schemes based on the Mel-frequency cepstral coefficients. © The Institution of Engineering and Technology 2014.
引用
收藏
页码:241 / U174
页数:2
相关论文
共 50 条
  • [31] Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme
    Shi, Yong-zhe
    Zhang, Wei-Qiang
    Liu, Jia
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2496 - 2499
  • [32] A Robust Audio Fingerprinting Method for Content-Based Copy Detection
    Ouali, Chahid
    Dumouchel, Pierre
    Gupta, Vishwa
    2014 12TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2014,
  • [33] Modulation frequency features for audio fingerprinting
    Sukittanon, S
    Atlas, LE
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1773 - 1776
  • [34] Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients
    Milner, Ben
    Darch, Jonathan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 338 - 347
  • [35] Damped Oscillator Cepstral Coefficients for Robust Speech Recognition
    Mitra, Vikramjit
    Franco, Horacio
    Graciarena, Martin
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 886 - 890
  • [36] Frequency filtering for a highly robust audio fingerprinting scheme in a real-noise environment
    Park, Mansoo
    Kim, Hoi-Rin
    Ro, Yong Man
    Kim, Munchurl
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (07) : 2324 - 2327
  • [37] A Robust Feature Extraction Algorithm for Audio Fingerprinting
    Chen, Jianping
    Huang, Tiejun
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 887 - +
  • [38] Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
    Eskidere, Omer
    Gurhanli, Ahmet
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [39] Cough Recognition Based on Mel Frequency Cepstral Coefficients and Dynamic Time Warping
    Zhu, Chunmei
    Liu, Baojun
    Li, Ping
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND CONTROL SYSTEMS (MECS2015), 2016, : 326 - 329
  • [40] Improved DTW Speech Recognition Algorithm Based on the MEL Frequency Cepstral Coefficients
    Wei Ming-zhe
    Li Xi
    Ren Li-mian
    12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 235 - 238