Scalable inductive learning on partitioned data

被引:0
|
作者
Chen, QJ [1 ]
Wu, XD [1 ]
Zhu, XQ [1 ]
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid advancement of information technology, scalability has become a necessity for learning algorithms to deal with large, real-world data repositories. In this paper, scalability is accomplished through a data reduction technique, which partitions a large data set into subsets, applies a learning algorithm on each subset sequentially or concurrently, and then integrates the learned results. Five strategies to achieve scalability (Rule-Example Conversion, Rule Weighting, Iteration, Good Rule Selection, and Data Dependent Rule Selection) are identified and seven corresponding scalable schemes are designed and developed. A substantial number of experiments have been performed to evaluate these schemes. Experimental results demonstrate that through data reduction some of our schemes can effectively generate accurate classifiers from weak classifiers generated from data subsets. Furthermore, our schemes require significantly less training time than that of generating a global classifier.
引用
收藏
页码:391 / 403
页数:13
相关论文
共 50 条
  • [31] PRISAD: A partitioned rendering infrastructure for scalable accordion drawing
    Slack, J
    Hildebrand, K
    Munzner, T
    INFOVIS 05: IEEE SYMPOSIUM ON INFORMATION VISUALIZATION, PROCEEDINGS, 2005, : 41 - 48
  • [32] Equivariance and Invariance Inductive Bias for Learning from Insufficient Data
    Wad, Tan
    Sun, Qianru
    Pranata, Sugiri
    Jayashree, Karlekar
    Zhang, Hanwang
    COMPUTER VISION, ECCV 2022, PT XI, 2022, 13671 : 241 - 258
  • [33] Scalable Manifold Learning for Big Data with Apache Spark
    Schoeneman, Frank
    Zola, Jaroslaw
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 272 - 281
  • [34] Towards scalable and data efficient learning of Markov boundaries
    Pena, Jose M.
    Nilsson, Roland
    Bjorkegren, Johan
    Tegner, Jesper
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2007, 45 (02) : 211 - 232
  • [35] Inductive learning from numerical and symbolic data: An integrated framework
    Esposito, Floriana
    Malerba, Donato
    Marengo, Vittorio
    Intelligent Data Analysis, 2001, 5 (06) : 445 - 461
  • [36] On the Accuracy of Meta-learning for Scalable Data Mining
    Chan P.K.
    Stolfo S.J.
    Journal of Intelligent Information Systems, 1997, 8 (1) : 5 - 28
  • [37] Learning Interpretable Rules for Scalable Data Representation and Classification
    Wang, Zhuo
    Zhang, Wei
    Liu, Ning
    Wang, Jianyong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1121 - 1133
  • [38] INDUCTIVE LEARNING
    HARDIE, CD
    EDUCATIONAL THEORY, 1975, 25 (01) : 40 - 44
  • [39] Inductive Learning
    吴信东
    JournalofComputerScienceandTechnology, 1993, (02) : 118 - 132
  • [40] Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data
    Castiglia, Timothy
    Das, Anirban
    Wang, Shiqiang
    Patterson, Stacy
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,