kNNVWC: An Efficient k-Nearest Neighbors Approach Based on Various-Widths Clustering

被引:31
|
作者
Almalawi, Abdul Mohsen [1 ]
Fahad, Adil [2 ]
Tari, Zahir [3 ]
Cheema, Muhammad Aamir [4 ]
Khalil, Ibrahim [3 ]
机构
[1] King Abdulaziz Univ, Sch Comp Sci & Informat Technol, Jeddah 21413, Saudi Arabia
[2] Al Baha Univ, Coll Comp Sci & Informat Technol, Dept Comp Sci, Al Baha, Saudi Arabia
[3] RMIT Univ, Sch Comp Sci & Informat Technol CSIT, Distributed Syst & Networking DSN Discipline, Melbourne, Vic 3000, Australia
[4] Monash Univ, Fac Informat Technol, Clayton, Vic 3168, Australia
基金
澳大利亚研究理事会;
关键词
Clustering; K-nearest neighbour; high dimensionality; performance; SCADA; DISTANCE-BASED OUTLIERS; ALGORITHM; SEARCH;
D O I
10.1109/TKDE.2015.2460735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbor approach (k-NN) has been extensively used as a powerful non-parametric technique in many scientific and engineering applications. However, this approach incurs a large computational cost. Hence, this issue has become an active research field. In this work, a novel k-NN approach based on various-widths clustering, named kNNVWC, to efficiently find k-NNs for a query object from a given data set, is presented. kNNVWC does clustering using various widths, where a data set is clustered with a global width first and each produced cluster that meets the predefined criteria is recursively clustered with its own local width that suits its distribution. This reduces the clustering time, in addition to balancing the number of produced clusters and their respective sizes. Maximum efficiency is achieved by using triangle inequality to prune unlikely clusters. Experimental results demonstrate that kNNVWC performs well in finding k-NNs for query objects compared to a number of k-NN search algorithms, especially for a data set with high dimensions, various distributions and large size.
引用
收藏
页码:68 / 81
页数:14
相关论文
共 50 条
  • [1] kNNVWC: An Efficient k-Nearest Neighbours Approach based on Various-Widths Clustering
    Almalawi, Abdulmohsen
    Fahad, Adil
    Tari, Zahir
    Cheema, Muhammad Aamir
    Khalil, Ibrahim
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1572 - 1573
  • [2] An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
    Raneem Qaddoura
    Hossam Faris
    Ibrahim Aljarah
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 675 - 714
  • [3] An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
    Qaddoura, Raneem
    Faris, Hossam
    Aljarah, Ibrahim
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 675 - 714
  • [4] K-nearest neighbors clustering algorithm
    Gauza, Dariusz
    Zukowska, Anna
    Nowak, Robert
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2014, 2014, 9290
  • [5] Information theoretic clustering using a k-nearest neighbors approach
    Vikjord, Vidar V.
    Jenssen, Robert
    PATTERN RECOGNITION, 2014, 47 (09) : 3070 - 3081
  • [6] Relative density based K-nearest neighbors clustering algorithm
    Liu, QB
    Deng, S
    Lu, CH
    Wang, B
    Zhou, YF
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 133 - 137
  • [7] Density peaks clustering based on k-nearest neighbors sharing
    Fan, Tanghuai
    Yao, Zhanfeng
    Han, Longzhe
    Liu, Baohong
    Lv, Li
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [8] AN APPROXIMATE CLUSTERING TECHNIQUE BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    AUTOMATION AND REMOTE CONTROL, 1992, 53 (10) : 1592 - 1598
  • [9] Search for top-k spatial objects based on various-widths clustering
    Yu, Shoujian
    Feng, Guangyi
    2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2017, : 362 - 367
  • [10] Graph Clustering Using Mutual K-Nearest Neighbors
    Sardana, Divya
    Bhatnagar, Raj
    ACTIVE MEDIA TECHNOLOGY, AMT 2014, 2014, 8610 : 35 - 48