Analysis of minimum distances in high-dimensional musical spaces

被引:42
|
作者
Casey, Michael [1 ,2 ]
Rhodes, Christophe [1 ]
Slaney, Malcolm [3 ]
机构
[1] Univ London Goldsmiths Coll, Dept Comp, London SE14 6NW, England
[2] Dartmouth Coll, Dept Mus, Hanover, NH 03755 USA
[3] Yahoo Res Inc, Santa Clara, CA 95054 USA
基金
英国工程与自然科学研究理事会;
关键词
audio shingles; distance distributions; locality-sensitive hashing; matched-filter distance; music similarity;
D O I
10.1109/TASL.2008.925883
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an automatic method for measuring content-based music similarity, enhancing the current generation of music search engines and recommender systems. Many previous approaches to track similarity require brute-force, pair-wise processing between all audio features in a database and therefore are not practical for large collections. However, in an Internet-connected world, where users have access to millions of musical tracks, efficiency is crucial. Our approach uses features extracted from unlabeled audio data and near-neigbor retrieval using a distance threshold, determined by analysis, to solve a range of retrieval tasks. The tasks require temporal features-analogous to the technique of shingling used for text retrieval. To measure similarity, we count pairs of audio shingles, between a query and target track, that are below a distance threshold. The distribution of between-shingle distances is different for each database; therefore, we present an analysis of the distribution of minimum distances between shingles and a method for estimating a distance threshold for optimal retrieval performance. The method is compatible with locality-sensitive hashing (LSH)-allowing implementation with retrieval times several orders of magnitude faster than those using exhaustive distance computations. We evaluate the performance of our proposed method on three contrasting music similarity tasks: retrieval of mis-attributed recordings (fingerprint), retrieval of the same work performed by different artists (cover songs), and retrieval of edited and sampled versions of a query track by remix artists (remixes). Our method achieves near-perfect performance in the first two tasks and 75% precision at 70% recall in the third task. Each task was performed on a test database comprising 4.5 million audio shingles.
引用
收藏
页码:1015 / 1028
页数:14
相关论文
共 50 条
  • [31] Structure and visualization of high-dimensional conductance spaces
    Taylor, Adam L.
    Hickey, Timothy J.
    Prinz, Astrid A.
    Marder, Eve
    JOURNAL OF NEUROPHYSIOLOGY, 2006, 96 (02) : 891 - 905
  • [32] Constructive kissing numbers in high-dimensional spaces
    Xu, Lanju
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2007, 20 (01) : 30 - 40
  • [33] ASPECTS OF HIGH-DIMENSIONAL THEORIES IN EMBEDDING SPACES
    MAIA, MD
    MECKLENBURG, W
    JOURNAL OF MATHEMATICAL PHYSICS, 1984, 25 (10) : 3047 - 3050
  • [34] Adaptive Indexing in High-Dimensional Metric Spaces
    Lampropoulos, Konstantinos
    Zardbani, Fatemeh
    Mamoulis, Nikos
    Karras, Panagiotis
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (10): : 2525 - 2537
  • [35] Balancing Geometry and Density: Path Distances on High-Dimensional Data
    Little, Anna
    McKenzie, Daniel
    Murphy, James M.
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2022, 4 (01): : 72 - 99
  • [36] BrePartition: Optimized High-Dimensional kNN Search With Bregman Distances
    Song, Yang
    Gu, Yu
    Zhang, Rui
    Yu, Ge
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1053 - 1065
  • [37] Minimum Distance Lasso for robust high-dimensional regression
    Lozano, Aurelie C.
    Meinshausen, Nicolai
    Yang, Eunho
    ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (01): : 1296 - 1340
  • [38] Robustness Analysis of Eleven Linear Classifiers in Extremely High-Dimensional Feature Spaces
    Lausser, Ludwig
    Kestler, Hans A.
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2010, 5998 : 72 - 83
  • [39] High-Fidelity Frequency Converter in High-Dimensional Spaces
    Yuan, Jinpeng
    Wang, Xuewen
    Chen, Gang
    Wang, Lirong
    Xiao, Liantuan
    Jia, Suotang
    LASER & PHOTONICS REVIEWS, 2024, 18 (11)
  • [40] High-dimensional discriminant analysis
    Bouveyron, Charles
    Girard, Stephane
    Schmid, Cordelia
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2007, 36 (13-16) : 2607 - 2623