Analysis of minimum distances in high-dimensional musical spaces

被引:42
|
作者
Casey, Michael [1 ,2 ]
Rhodes, Christophe [1 ]
Slaney, Malcolm [3 ]
机构
[1] Univ London Goldsmiths Coll, Dept Comp, London SE14 6NW, England
[2] Dartmouth Coll, Dept Mus, Hanover, NH 03755 USA
[3] Yahoo Res Inc, Santa Clara, CA 95054 USA
基金
英国工程与自然科学研究理事会;
关键词
audio shingles; distance distributions; locality-sensitive hashing; matched-filter distance; music similarity;
D O I
10.1109/TASL.2008.925883
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an automatic method for measuring content-based music similarity, enhancing the current generation of music search engines and recommender systems. Many previous approaches to track similarity require brute-force, pair-wise processing between all audio features in a database and therefore are not practical for large collections. However, in an Internet-connected world, where users have access to millions of musical tracks, efficiency is crucial. Our approach uses features extracted from unlabeled audio data and near-neigbor retrieval using a distance threshold, determined by analysis, to solve a range of retrieval tasks. The tasks require temporal features-analogous to the technique of shingling used for text retrieval. To measure similarity, we count pairs of audio shingles, between a query and target track, that are below a distance threshold. The distribution of between-shingle distances is different for each database; therefore, we present an analysis of the distribution of minimum distances between shingles and a method for estimating a distance threshold for optimal retrieval performance. The method is compatible with locality-sensitive hashing (LSH)-allowing implementation with retrieval times several orders of magnitude faster than those using exhaustive distance computations. We evaluate the performance of our proposed method on three contrasting music similarity tasks: retrieval of mis-attributed recordings (fingerprint), retrieval of the same work performed by different artists (cover songs), and retrieval of edited and sampled versions of a query track by remix artists (remixes). Our method achieves near-perfect performance in the first two tasks and 75% precision at 70% recall in the third task. Each task was performed on a test database comprising 4.5 million audio shingles.
引用
收藏
页码:1015 / 1028
页数:14
相关论文
共 50 条
  • [21] Stable moduli spaces of high-dimensional handlebodies
    Botvinnik, Boris
    Perlmutter, Nathan
    JOURNAL OF TOPOLOGY, 2017, 10 (01) : 101 - 163
  • [22] High-dimensional ellipsoids converge to Gaussian spaces
    Kazukawa, Daisuke
    Shioya, Takashi
    JOURNAL OF THE MATHEMATICAL SOCIETY OF JAPAN, 2024, 76 (02) : 473 - 501
  • [23] Stable moduli spaces of high-dimensional manifolds
    Galatius, Soren
    Randal-Williams, Oscar
    ACTA MATHEMATICA, 2014, 212 (02) : 257 - 377
  • [24] Aspects of Particle Filtering in High-Dimensional Spaces
    van Leeuwen, Peter Jan
    DYNAMIC DATA-DRIVEN ENVIRONMENTAL SYSTEMS SCIENCE, DYDESS 2014, 2015, 8964 : 251 - 262
  • [25] Pattern selection in high-dimensional parameter spaces
    Viehoever, Georg
    Ward, Brian
    Stock, Hans-Juergen
    OPTICAL MICROLITHOGRAPHY XXV, PTS 1AND 2, 2012, 8326
  • [26] Differential evolution in high-dimensional search spaces
    Olorunda, Olusegun
    Engelbrecht, Andries P.
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1934 - 1941
  • [27] Optimal outlier removal in high-dimensional spaces
    Dunagan, J
    Vempala, S
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2004, 68 (02) : 335 - 373
  • [28] Constructive Kissing Numbers in High-Dimensional Spaces
    Lanju Xu
    Journal of Systems Science and Complexity, 2007, 20 : 30 - 40
  • [29] Mining Projected Clusters in High-Dimensional Spaces
    Bouguessa, Mohamed
    Wang, Shengrui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (04) : 507 - 522
  • [30] High-dimensional distributed semantic spaces for utterances
    Karlgren, Jussi
    Kanerva, Pentti
    NATURAL LANGUAGE ENGINEERING, 2019, 25 (04) : 503 - 518