COVER SONG IDENTIFICATION WITH 2D FOURIER TRANSFORM SEQUENCES

被引:0
|
作者
Seetharaman, Prem [1 ]
Rafii, Zajar [2 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Gracenote Inc, Emeryville, CA USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
关键词
Cover song identification; audio finger-printing; Constant Q transform; 2D Fourier transform; adaptive thresholding;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We approach cover song identification using a novel time-series representation of audio based on the 2DFT. The audio is represented as a sequence of magnitude 2D Fourier Transforms (2DFT). This representation is robust to key changes, timbral changes, and small local tempo deviations. We look at cross-similarity between these time-series, and extract a distance measure that is invariant to music structure changes. Our approach is state-of-the-art on a recent cover song dataset, and expands on previous work using the 2DFT for music representation and work on live song recognition.
引用
收藏
页码:616 / 620
页数:5
相关论文
共 50 条
  • [31] FPGA ARCHITECTURE FOR 2D DISCRETE FOURIER TRANSFORM BASED ON 2D DECOMPOSITION FOR LARGE-SIZED DATA
    Kim, Jung Sub
    Yu, Chi-Li
    Deng, Lanping
    Kestur, Srinidhi
    Narayanan, Vijaykrishnan
    Chakrabarti, Chaitali
    SIPS: 2009 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS, 2009, : 121 - +
  • [32] FPGA Architecture for 2D Discrete Fourier Transform Based on 2D Decomposition for Large-sized Data
    Yu, Chi-Li
    Kim, Jung-Sub
    Deng, Lanping
    Kestur, Srinidhi
    Narayanan, Vijaykrishnan
    Chakrabarti, Chaitali
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 64 (01): : 109 - 122
  • [33] Extractions of Channel Wave in a Coal Seam Based on the 2D Fourier Transform
    Wang, Ji
    Zhu, Shujie
    Chen, Jianyuan
    Hu, Jiwu
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL VI, 2011, : 604 - 607
  • [34] Femtosecond and 2D Fourier transform experiments on Jahn-Teller dynamics
    Jonas, DM
    Farrow, DA
    Qian, W
    Smith, ER
    Ferro, AA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U278 - U278
  • [35] Polarization-dependent optical 2D Fourier transform spectroscopy of semiconductors
    Zhang, Tianhao
    Kuznetsova, Irina
    Meier, Torsten
    Li, Xiaoclin
    Mirin, Richard P.
    Thomas, Peter
    Cundiff, Steven T.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (36) : 14227 - 14232
  • [36] New encryption method of 2D image by use of the fractional Fourier transform
    Yoshimura, Hiroyuki
    Iwai, Reiko
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 2179 - +
  • [37] Algorithm 991: The 2D Tree Sliding Window Discrete Fourier Transform
    Richardson, Lee F.
    Eddy, William F.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2019, 45 (01):
  • [38] Analysis of planar anisotropy of fibre systems by using 2D Fourier transform
    Tunak, Maros
    Linka, Ales
    FIBRES & TEXTILES IN EASTERN EUROPE, 2007, 15 (5-6) : 86 - 90
  • [39] 2D fourier transform for global analysis and classification of meibomian gland images
    Ciezar, Kamila
    Pochylski, Mikolaj
    OCULAR SURFACE, 2020, 18 (04): : 865 - 870
  • [40] Extractions of Channel Wave in a Coal Seam Based on the 2D Fourier Transform
    Wang, Ji
    Zhu, Shujie
    Chen, Jianyuan
    Hu, Jiwu
    2011 AASRI CONFERENCE ON INFORMATION TECHNOLOGY AND ECONOMIC DEVELOPMENT (AASRI-ITED 2011), VOL 3, 2011, : 381 - 384