Robust audio fingerprinting based on GammaChirp frequency cepstral coefficients and chroma

被引：3

作者：

Chen, N. ^{[1
]}

Xiao, H. D. ^{[2
]}

Zhu, J. ^{[3
]}

机构：

[1] E China Univ Sci & Technol, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China

[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China

[3] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

来源：

ELECTRONICS LETTERS | 2014年 / 50卷 / 04期

基金：

上海市自然科学基金; 中国国家自然科学基金;

关键词：

D O I：

10.1049/el.2013.3554

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A novel auditory feature that combines an auditory model and music theory is proposed for audio fingerprinting. First, the input audio is filtered by a GammaChirp (GC) filterbank to model the cochlear frequency selectivity. Then, the output of the filterbank is downsampled and decorrelated by a discrete cosine transform to obtain the GammaChirp frequency cepstral coefficients (GCFCCs). Next, some lowest order GCFCCs are projected onto the chroma to characterise both melodic and harmonic information of the input. Finally, non-negative matrix factorisation is applied to the chroma matrix to reduce its dimension while maintaining its discriminative power. The experimental results illustrate that the proposed scheme achieves a stabler identification rate and lower computational complexity than the schemes based on the Mel-frequency cepstral coefficients. © The Institution of Engineering and Technology 2014.

引用

页码：241 / U174

页数：2

共 50 条

[31] Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme
Shi, Yong-zhe
Zhang, Wei-Qiang
Liu, Jia
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2496 - 2499
[32] A Robust Audio Fingerprinting Method for Content-Based Copy Detection
Ouali, Chahid
Dumouchel, Pierre
Gupta, Vishwa
2014 12TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2014,
[33] Modulation frequency features for audio fingerprinting
Sukittanon, S
Atlas, LE
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1773 - 1776
[34] Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients
Milner, Ben
Darch, Jonathan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 338 - 347
[35] Damped Oscillator Cepstral Coefficients for Robust Speech Recognition
Mitra, Vikramjit
Franco, Horacio
Graciarena, Martin
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 886 - 890
[36] Frequency filtering for a highly robust audio fingerprinting scheme in a real-noise environment
Park, Mansoo
Kim, Hoi-Rin
Ro, Yong Man
Kim, Munchurl
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (07) : 2324 - 2327
[37] A Robust Feature Extraction Algorithm for Audio Fingerprinting
Chen, Jianping
Huang, Tiejun
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 887 - +
[38] Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
Eskidere, Omer
Gurhanli, Ahmet
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
[39] Cough Recognition Based on Mel Frequency Cepstral Coefficients and Dynamic Time Warping
Zhu, Chunmei
Liu, Baojun
Li, Ping
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND CONTROL SYSTEMS (MECS2015), 2016, : 326 - 329
[40] Improved DTW Speech Recognition Algorithm Based on the MEL Frequency Cepstral Coefficients
Wei Ming-zhe
Li Xi
Ren Li-mian
12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 235 - 238

← 1 2 3 4 5 →