Scalable inductive learning on partitioned data

被引:0
|
作者
Chen, QJ [1 ]
Wu, XD [1 ]
Zhu, XQ [1 ]
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid advancement of information technology, scalability has become a necessity for learning algorithms to deal with large, real-world data repositories. In this paper, scalability is accomplished through a data reduction technique, which partitions a large data set into subsets, applies a learning algorithm on each subset sequentially or concurrently, and then integrates the learned results. Five strategies to achieve scalability (Rule-Example Conversion, Rule Weighting, Iteration, Good Rule Selection, and Data Dependent Rule Selection) are identified and seven corresponding scalable schemes are designed and developed. A substantial number of experiments have been performed to evaluate these schemes. Experimental results demonstrate that through data reduction some of our schemes can effectively generate accurate classifiers from weak classifiers generated from data subsets. Furthermore, our schemes require significantly less training time than that of generating a global classifier.
引用
收藏
页码:391 / 403
页数:13
相关论文
共 50 条
  • [21] Scalable Neural Data Server: A Data Recommender for Transfer Learning
    Cao, Tianshi
    Doubov, Sasha
    Acuna, David
    Fidler, Sanja
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [22] Scalable acceleration of inductive logic programs
    Fidjeland, A
    Luk, W
    Muggleton, S
    2002 IEEE INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), PROCEEDINGS, 2002, : 252 - 259
  • [23] QuickFOIL: Scalable Inductive Logic Programming
    Zeng, Qiang
    Patel, Jignesh M.
    Page, David
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (03): : 197 - 208
  • [24] Learning algorithms for vector quantization using vertically partitioned data with IoT
    Hirofumi Miyajima
    Noritaka Shigei
    Hiromi Miyajima
    Norio Shiratori
    Artificial Life and Robotics, 2021, 26 : 283 - 290
  • [25] Classification with boosting of extreme learning machine over arbitrarily partitioned data
    Catak, Ferhat Ozgur
    SOFT COMPUTING, 2017, 21 (09) : 2269 - 2281
  • [26] Classification with boosting of extreme learning machine over arbitrarily partitioned data
    Ferhat Özgür Çatak
    Soft Computing, 2017, 21 : 2269 - 2281
  • [27] Learning algorithms for vector quantization using vertically partitioned data with IoT
    Miyajima, Hirofumi
    Shigei, Noritaka
    Miyajima, Hiromi
    Shiratori, Norio
    ARTIFICIAL LIFE AND ROBOTICS, 2021, 26 (03) : 283 - 290
  • [28] Secure and scalable deduplication of horizontally partitioned health data for privacy-preserving distributed statistical computation
    Kassaye Yitbarek Yigzaw
    Antonis Michalas
    Johan Gustav Bellika
    BMC Medical Informatics and Decision Making, 17
  • [29] Secure and scalable deduplication of horizontally partitioned health data for privacy-preserving distributed statistical computation
    Yigzaw, Kassaye Yitbarek
    Michalas, Antonis
    Bellika, Johan Gustav
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17 : 1 - 19
  • [30] Efficient and Scalable Initialization of Partitioned Coupled Simulations with preCICE
    Totounferoush, Amin
    Simonis, Frederic
    Uekermann, Benjamin
    Schulte, Miriam
    ALGORITHMS, 2021, 14 (06)