Fast Redescription Mining Using Locality-Sensitive Hashing

被引:0
|
作者
Karjalainen, Maiju [1 ]
Galbrun, Esther [1 ]
Miettinen, Pauli [1 ]
机构
[1] Univ Eastern Finland, Kuopio, Finland
关键词
Redescription mining; Locality-Sensitive hashing;
D O I
10.1007/978-3-031-70368-3_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Redescription mining is a data analysis technique that has found applications in diverse fields. The most used redescription mining approaches involve two phases: finding matching pairs among data attributes and extending the pairs. This process is relatively efficient when the number of attributes remains limited and when the attributes are Boolean, but becomes almost intractable when the data consist of many numerical attributes. In this paper, we present new algorithms that perform the matching and extension orders of magnitude faster than the existing approaches. Our algorithms are based on locality-sensitive hashing with a tailored approach to handle the discretisation of numerical attributes as used in redescription mining.
引用
收藏
页码:124 / 142
页数:19
相关论文
共 50 条
  • [41] Efficient Outlier Detection in Hyperedge Streams Using MinHash and Locality-Sensitive Hashing
    Ranshous, Stephen
    Chaudhary, Mandar
    Samatova, Nagiza F.
    COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 105 - 116
  • [42] Reducing the Complexity of Fingerprinting-Based Positioning using Locality-Sensitive Hashing
    Tang, Larry
    Ghods, Ramina
    Studer, Christoph
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1086 - 1090
  • [43] Hardware acceleration of k-mer clustering using locality-sensitive hashing
    Soto, Javier E.
    Krohmer, Thomas
    Hernandez, Cecilia
    Figueroa, Miguel
    2019 22ND EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2019, : 659 - 662
  • [44] Improving Kernel Locality-Sensitive Hashing Using Pre-Images and Bounds
    Bodo, Zalan
    Csato, Lehel
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [45] Fast Duplicate Detection Using Locality Sensitive Hashing
    Rong, C. T.
    Feng, L. J.
    INTERNATIONAL CONFERENCE ON ADVANCED EDUCATIONAL TECHNOLOGY AND INFORMATION ENGINEERING (AETIE 2015), 2015, : 580 - 588
  • [46] Faster Sieving for Shortest Lattice Vectors Using Spherical Locality-Sensitive Hashing
    Laarhoven, Thijs
    de Weger, Benne
    PROGRESS IN CRYPTOLOGY - LATINCRYPT 2015, 2015, 9230 : 101 - 118
  • [47] Digital Watermarks for Videos Based on a Locality-Sensitive Hashing Algorithm
    Sun, Yajuan
    Srivastava, Gautam
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1724 - 1737
  • [48] Locality-Sensitive Hashing for Finding Nearest Neighbors in Probability Distributions
    Tang, Yi-Kun
    Mao, Xian-Ling
    Hao, Yi-Jing
    Xu, Cheng
    Huang, Heyan
    SOCIAL MEDIA PROCESSING, SMP 2017, 2017, 774 : 3 - 15
  • [49] On the Problem of p1-1 in Locality-Sensitive Hashing
    Ahle, Thomas Dybdahl
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 85 - 93
  • [50] An improved method of locality-sensitive hashing for scalable instance matching
    Mehmet Aydar
    Serkan Ayvaz
    Knowledge and Information Systems, 2019, 58 : 275 - 294