COVER SONG IDENTIFICATION WITH 2D FOURIER TRANSFORM SEQUENCES

被引:0
|
作者
Seetharaman, Prem [1 ]
Rafii, Zajar [2 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Gracenote Inc, Emeryville, CA USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
关键词
Cover song identification; audio finger-printing; Constant Q transform; 2D Fourier transform; adaptive thresholding;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We approach cover song identification using a novel time-series representation of audio based on the 2DFT. The audio is represented as a sequence of magnitude 2D Fourier Transforms (2DFT). This representation is robust to key changes, timbral changes, and small local tempo deviations. We look at cross-similarity between these time-series, and extract a distance measure that is invariant to music structure changes. Our approach is state-of-the-art on a recent cover song dataset, and expands on previous work using the 2DFT for music representation and work on live song recognition.
引用
收藏
页码:616 / 620
页数:5
相关论文
共 50 条
  • [1] Cover song search based on magnitude and phase of the 2D Fourier transform
    Seo, Jin Soo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2018, 37 (06): : 518 - 524
  • [2] Revised 2D Fast Fourier Transform
    Pupeikis, Rimantas
    2015 OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM), 2015,
  • [3] Comparison of 2D S-Transform Profilometry and 2D Windowed Fourier Transform Profilornetry
    Chen, Wenjing
    Shen, Qiuju
    Zhong, Min
    OPTIK, 2013, 124 (24): : 6732 - 6736
  • [4] A GENERAL FORM OF 2D FOURIER TRANSFORM EIGENFUNCTIONS
    Pei, Soo-Chang
    Liu, Chun-Lin
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3701 - 3704
  • [5] 2D Discrete Fourier Transform on Sliding Windows
    Park, Chun-Su
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (03) : 901 - 907
  • [7] Sliding 2D Discrete Fractional Fourier Transform
    Liu, Yu
    Miao, Hongxia
    Zhang, Feng
    Tao, Ran
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (12) : 1733 - 1737
  • [8] The extended fourier transform for 2D spectral estimation
    Armstrong, GS
    Mandelshtam, VA
    JOURNAL OF MAGNETIC RESONANCE, 2001, 153 (01) : 22 - 31
  • [9] Fingerprint Matching by Using 2D Discrete Cosine Transform And 2D Fourier Transforms
    Insankeovilay, Souksamay
    Choomchuay, Somsak
    Hamamoto, Kazuhiko
    5TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2012), 2012,
  • [10] Fingerprint Matching by Using 2D Discrete Cosine Transform And 2D Fourier Transforms
    Insankeovilay, Souksamay
    Choomchuay, Somsak
    Hamamoto, Kazuhiko
    5TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2012), 2012, : 53 - 54