A strong coreset algorithm to accelerate OPF as a graph-based machine learning in large-scale problems

被引:1
|
作者
Bostani, Hamid [1 ]
Sheikhan, Mansour [2 ]
Mahboobi, Behrad [3 ]
机构
[1] Islamic Azad Univ, South Tehran Branch, Young Researchers & Elite Club, Tehran, Iran
[2] Islamic Azad Univ, Dept Elect Engn, South Tehran Branch, Tehran, Iran
[3] Islamic Azad Univ, Commun Comp & Ind Network Res Ctr, Dept Elect & Comp Engn, Sci & Res Branch, Tehran, Iran
基金
美国国家科学基金会;
关键词
Coreset; Optimum-path forest; Large-scale problems; Massive datasets; OPTIMUM-PATH FOREST; INTRUSION DETECTION; CLASSIFICATION; HYBRID;
D O I
10.1016/j.ins.2020.10.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimum-path forest (OPF) is one of the efficient graph-based frameworks that can determine the patterns of input dataset by extracting the optimal partitions of graph obtained through encoding data into a graph. Since OPF was introduced based on simple assumptions without considering the requirements of large-scale problems, this machine learning is an effective algorithm only for a reasonable size of input datasets. To provide a scalable OPF, this study introduces a strong coreset for accelerating OPF algorithm. Applying this approach can expedite OPF procedure, especially when it is working on massive datasets. Accordingly, a novel algebra is developed to represent the problem of OPF as an optimization problem for the proposed coreset definition. A novel coreset construction algorithm that can approximate the OPF solutions is subsequently proposed in order to improve the OPF construction speed. The simulation results of diverse experiments on various benchmark datasets illustrate computation gain and superiority of the proposed algorithm in terms of the construction and classification speeds as compared to the original algorithm while displaying reliably accurate performance. The presented coreset construction algorithm performs the training and testing phases of OPF up to 6.1 and 4.9 times faster than before, respectively. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:424 / 441
页数:18
相关论文
共 50 条
  • [31] Reducing Computational Complexity of Factor Graph-Based Belief Propagation Algorithm for Detection in Large-Scale MIMO Systems
    Abbaszadeh, Iman
    Darabi, Mostafa
    Ardebilipour, Mehrdad
    Maham, Behrouz
    2019 IEEE 30TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2019, : 1264 - 1269
  • [32] Graph-Based Wrong IsA Relation Detection in a Large-Scale Lexical Taxonomy
    Liang, Jiaqing
    Xiao, Yanghua
    Zhang, Yi
    Hwang, Seung-won
    Wang, Haixun
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1178 - 1184
  • [33] A Probabilistic Computing Approach of Attack Graph-Based Nodes in Large-scale Network
    Ye Yun
    Xu Xi-shan
    Qi Zhi-chang
    2011 3RD INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY ESIAT 2011, VOL 10, PT A, 2011, 10 : 3 - 8
  • [34] Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets
    Lee, Wen-Yu
    Hsieh, Liang-Chi
    Wu, Guan-Long
    Hsu, Winston
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (03) : 295 - 302
  • [35] A Study on Reachability Problems of Large-scale Graph
    Ma, Jing-yan
    Zhang, Ke-hong
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND TECHNOLOGY (ICCST 2015), 2015, : 244 - 254
  • [36] Petascale computing for large-scale graph problems
    Bader, David A.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 166 - 169
  • [37] Petascale computing for large-scale graph problems
    Bader, David A.
    CISIS 2008: THE SECOND INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, PROCEEDINGS, 2008, : 779 - 779
  • [38] Distributed Learning Algorithm for Distributed PV Large-Scale Access to Power Grid Based on Machine Learning
    Lei, Zhen
    Yang, Yong-biao
    Xu, Xiao-hui
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT I, 2019, 301 : 439 - 447
  • [39] Large-scale OPF Based on Voltage Grading and Network Partition
    Yang, Yude
    Wu, Zhongbiao
    Zhang, Yiyi
    Wei, Hua
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2016, 2 (02): : 56 - 61
  • [40] A scalable algorithm for graph-based active learning
    Zhao, Wentao
    Long, Jun
    Zhu, En
    Liu, Yun
    FRONTIERS IN ALGORITHMICS, 2008, 5059 : 311 - 322