Analysis of minimum distances in high-dimensional musical spaces

被引:42
|
作者
Casey, Michael [1 ,2 ]
Rhodes, Christophe [1 ]
Slaney, Malcolm [3 ]
机构
[1] Univ London Goldsmiths Coll, Dept Comp, London SE14 6NW, England
[2] Dartmouth Coll, Dept Mus, Hanover, NH 03755 USA
[3] Yahoo Res Inc, Santa Clara, CA 95054 USA
基金
英国工程与自然科学研究理事会;
关键词
audio shingles; distance distributions; locality-sensitive hashing; matched-filter distance; music similarity;
D O I
10.1109/TASL.2008.925883
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an automatic method for measuring content-based music similarity, enhancing the current generation of music search engines and recommender systems. Many previous approaches to track similarity require brute-force, pair-wise processing between all audio features in a database and therefore are not practical for large collections. However, in an Internet-connected world, where users have access to millions of musical tracks, efficiency is crucial. Our approach uses features extracted from unlabeled audio data and near-neigbor retrieval using a distance threshold, determined by analysis, to solve a range of retrieval tasks. The tasks require temporal features-analogous to the technique of shingling used for text retrieval. To measure similarity, we count pairs of audio shingles, between a query and target track, that are below a distance threshold. The distribution of between-shingle distances is different for each database; therefore, we present an analysis of the distribution of minimum distances between shingles and a method for estimating a distance threshold for optimal retrieval performance. The method is compatible with locality-sensitive hashing (LSH)-allowing implementation with retrieval times several orders of magnitude faster than those using exhaustive distance computations. We evaluate the performance of our proposed method on three contrasting music similarity tasks: retrieval of mis-attributed recordings (fingerprint), retrieval of the same work performed by different artists (cover songs), and retrieval of edited and sampled versions of a query track by remix artists (remixes). Our method achieves near-perfect performance in the first two tasks and 75% precision at 70% recall in the third task. Each task was performed on a test database comprising 4.5 million audio shingles.
引用
收藏
页码:1015 / 1028
页数:14
相关论文
共 50 条
  • [1] On the Behavior of Intrinsically High-Dimensional Spaces: Distances, Direct and Reverse Nearest Neighbors, and Hubness
    Angiulli, Fabrizio
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [2] Statistical analysis on high-dimensional spheres and shape spaces
    Dryden, IL
    ANNALS OF STATISTICS, 2005, 33 (04): : 1643 - 1665
  • [3] EM in high-dimensional spaces
    Draper, BA
    Elliott, DL
    Hayes, J
    Baek, K
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (03): : 571 - 577
  • [4] Towards topological analysis of high-dimensional feature spaces
    Wagner, Hubert
    Dlotko, Pawel
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 121 : 21 - 26
  • [5] The mathematics of high-dimensional spaces
    Rogers, D
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1998, 215 : U524 - U524
  • [6] PRINCIPAL COMPONENT ANALYSIS IN VERY HIGH-DIMENSIONAL SPACES
    Lee, Young Kyung
    Lee, Eun Ryung
    Park, Byeong U.
    STATISTICA SINICA, 2012, 22 (03) : 933 - 956
  • [7] Topological visual analysis of clusterings in high-dimensional information spaces
    Oesterling, Patrick
    Jähnichen, Patrick
    Heyer, Gerhard
    Scheuermann, Gerik
    Physical Review Letters, 2021, 127 (05)
  • [8] Topological visual analysis of clusterings in high-dimensional information spaces
    Oesterling, Patrick
    Jaehnichen, Patrick
    Heyer, Gerhard
    Scheuermann, Gerik
    IT-INFORMATION TECHNOLOGY, 2015, 57 (01): : 3 - 10
  • [9] Containment problems in high-dimensional spaces
    Ishigami, Y
    GRAPHS AND COMBINATORICS, 1995, 11 (04) : 327 - 335
  • [10] Torelli spaces of high-dimensional manifolds
    Ebert, Johannes
    Randal-Williams, Oscar
    JOURNAL OF TOPOLOGY, 2015, 8 (01) : 38 - 64